Publications

Erica Ingerstad, Liv Kåreborn, "Planet-NeRF: Neural Radiance Fields for 3D Reconstruction on Satellite Imagery in Season Changing Environments", Student thesis, LiTH-ISY-EX--24/5631--SE, 2024.

AbstractKeywordsBiBTeXFulltext

Kaspar Rommel, "Influence of artificial turf on football technique using motion capture and 3D modelling", Student thesis, No. , 2024.

AbstractKeywordsBiBTeXFulltext

Evelina Hult, "Toward Equine Gait Analysis: Semantic Segmentation and 3D Reconstruction", Student thesis, LiTH-ISY-EX--23/5539--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Matheus Vieira Bernat, "Topical Classification of Images in Wikipedia: Development of topical classification models followed by a study of the visual content of Wikipedia", Student thesis, LiTH-ISY-EX--23/5538--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Matilda Granqvist, "Infrared and Visible Image Fusion with an Unsupervised Network", Student thesis, LiTH-ISY-EX--23/5540--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Maja Boström, "Point Cloud Registration using both Machine Learning and Non-learning Methods: with Data from a Photon-counting LIDAR Sensor", Student thesis, LiTH-ISY-EX--23/5558--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Simon Hermansson, "Learning Embeddings for Fashion Images", Student thesis, LiTH-ISY-EX--23/5567--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Johanna Carlson, Lovisa Byman, "Generation of Synthetic Traffic Sign Images using Diffusion Models", Student thesis, LiTH-ISY-EX--23/5563--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Christoffer Gärdin, "Anomaly Detection with Machine Learning using CLIP in a Video Surveillance Context", Student thesis, LiTH-ISY-EX--23/5564--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Kevin Bärudde, Marcus Gandal, "Industrial 3D Anomaly Detection and Localization Using Unsupervised Machine Learning", Student thesis, LiTH-ISY-EX--23/5569--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Moltas Enåkander, "ISAR Imaging Enhancement Without High-Resolution Ground Truth", Student thesis, LiTH-ISY-EX--23/5572--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Abstract

In synthetic aperture radar (SAR) and inverse synthetic aperture radar (ISAR), an imaging radar emits electromagnetic waves of varying frequencies towards a target and the backscattered waves are collected. By either moving the radar antenna or rotating the target and combining the collected waves, a much longer synthetic aperture can be created. These radar measurements can be used to determine the radar cross-section (RCS) of the target and to reconstruct an estimate of the target. However, the reconstructed images will suffer from spectral leakage effects and are limited in resolution. Many methods of enhancing the images exist and some are based on deep learning. Most commonly the deep learning methods rely on high-resolution ground truth data of the scene to train a neural network to enhance the radar images. In this thesis, a method that does not rely on any high-resolution ground truth data is applied to train a convolutional neural network to enhance radar images. The network takes a conventional ISAR image subject to spectral leakage effects as input and outputs an enhanced ISAR image which contains much more defined features. New RCS measurements are created from the enhanced ISAR image and the network is trained to minimise the difference between the original RCS measurements and the new RCS measurements. A sparsity constraint is added to ensure that the proposed enhanced ISAR image is sparse. The synthetic training data consists of scenes containing point scatterers that are either individual or grouped together to form shapes. The scenes are used to create synthetic radar measurements which are then used to reconstruct ISAR images of the scenes. The network is tested using both synthetic data and measurement data from a cylinder and two aeroplane models. The network manages to minimise spectral leakage and increase the resolution of the ISAR images created from both synthetic and measured RCSs, especially on measured data from target models which have similar features to the synthetic training data.

The contributions of this thesis work are firstly a convolutional neural network that enhances ISAR images affected by spectral leakage. The neural network handles complex-valued signals as a single channel and does not perform any rescaling of the input. Secondly, it is shown that it is sufficient to calculate the new RCS for much fewer frequency samples and angular positions and compare those measurements to the corresponding frequency samples and angular positions in the original RCS to train the neural network.

Erik Lidman, "Visual Bird's-Eye View Object Detection for Autonomous Driving", Student thesis, LiTH-ISY-EX--23/5579--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Felix Lindgren, "Efficient Utilization of Video Embeddings from Video-Language Models", Student thesis, LiTH-ISY-EX--23/5592--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Karl Karlsson, "Camera Distortion Calibration through Fringe Pattern Phase Analysis", Student thesis, LiTH-ISY-EX--23/5580--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Johannes Hägerlind, "3D-Reconstruction of the Common Murre", Student thesis, LiTH-ISY-EX--23/5576--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Daniel Bladh, "Deep Learning-Based Depth Estimation Models with Monocular SLAM: Impacts of Pure Rotational Movements on Scale Drift and Robustness", Student thesis, LiTH-ISY-EX--23/5630--SE, 2023.

AbstractKeywordsBiBTeXFulltext

Fanny Forsberg, "Domain Adaptation to Meet the Reality-Gap from Simulation to Reality", Student thesis, LiTH-ISY-EX--21/5453--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Therese Luong, "Windshield Distortion Modelling", Student thesis, LiTH-ISY-EX--22/5455--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Caspian Süsskind, "Deep Learning Semantic Segmentation of 3D Point Cloud Data from a Photon Counting LiDAR", Student thesis, LiTH-ISY-EX--22/5467--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Felicia Castenbrandt, "Image Similarity Scoring for Medical Images in 3D", Student thesis, LiTH-ISY-EX–22/5484–SE, 2022.

AbstractKeywordsBiBTeXFulltext

Emily Olsson, "Lens Distortion Correction Without Camera Access", Student thesis, LiTH-ISY-EX--22/5476--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Marcus Nolkrantz, "Efficient multiple hypothesis tracking using a purely functional array language", Student thesis, LiTH-ISY-EX--22/5482--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Hoang Tran, "Learning with Synthetically Blocked Images for Sensor Blockage Detection", Student thesis, LiTH-ISY-EX–22/5509–SE, 2022.

AbstractKeywordsBiBTeXFulltext

Filip Isaksson, "Measuring Porosity in Ceramic Coating using Convolutional Neural Networks and Semantic Segmentation", Student thesis, LiTH-ISY-EX--22/5490--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Gustav Dahmén, Erica Strand, "Forest Growth And Volume Estimation Using Machine Learning", Student thesis, LiTH-ISY-EX--22/5508--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Olle Sievers, "CNN-Based Methods for Tree Species Detection in UAV Images", Student thesis, LiTH-ISY-EX–22/5502–SE, 2022.

AbstractKeywordsBiBTeXFulltext

Sara Modorato, "Tracking Under Countermeasures Using Infrared Imagery", Student thesis, LiTH-ISY-EX–22/5473–SE, 2022.

AbstractKeywordsBiBTeXFulltext

Stina Gustafsson, "Learning to Measure Invisible Fish", Student thesis, LiTH-ISY-EX--22/5517--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Mikaela Hardebro, Elin Jirskog, "Transformer Based Object Detection and Semantic Segmentation for Autonomous Driving", Student thesis, LiTH-ISY-EX--22/5487--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Hasseli Zahra, Anwia Odisho Raamen, "Automatic Quality Assessment of Dermatology Images: A Comparison Between Machine Learning and Hand-Crafted Algorithms", Student thesis, LiTH-ISY-EX–22/5486–SE, 2022.

AbstractKeywordsBiBTeXFulltext

Abstract

In recent years, pictures from handheld devices such as smartphones have been increasingly utilized as a documentation tool by medical practitioners not trained to take professional photographs. Similarly to the other types of image modalities, the images should be taken in a way to capture the vital information in the region of interest. Nevertheless, image capturing cannot always be done as desired, so images may exhibit different blur types at the region of interest. Having blurry images does not serve medical purposes, therefore, the patients might have to schedule a second appointment several days later to retake the images. A solution to this problem is to create an algorithm which immediately after capturing an image determines if it is medically useful and notifies the user of the result. The algorithm needs to perform the analysis at a reasonable speed, and at best, with a limited number of operations to make the calculations directly in the smartphone device. A large number of medical images must be available to create such an algorithm. Medical images are difficult to acquire, and it is specifically difficult to acquire blurry images since they are usually deleted.

The main objective of this thesis is to determine the medical usefulness of images taken with smartphone cameras, using both machine learning and handcrafted algorithms, with a low number of floating point operations and a high performance. Seven different algorithms (one hand-crafted and six machine learned) are created and compared regarding both number of floating point operations and performance. Fast Walsh-Hadamard transforms are the basis of the hand-crafted algorithm. The employed machine learning algorithms are both based on common convolutional neural networks (MobileNetV3 and ResNet50) and on our own designs. The issue with the low number of medical images acquired is solved by training the machine learning models on a synthetic dataset, where the non-medically useful images are generated by applying blur on the medically useful images. These models do, however, undergo evaluation using a real dataset, containing medically useful images as well as non-medically useful images.

Our results implicate that a real-time determination of the medical usefulness of images is possible on handheld devices, since our machine learned model DeepLAD-Net reaches the highest accuracy with 42 · 10⁶ floating point operations. In terms of accuracy, MobileNetV3-large is the second best model with31 times as many floating point operations as our best model.

Niclas Hansson, "Investigation of Registration Methods for High Resolution SAR-EO Imagery", Student thesis, LiTH-ISY-EX--22/5506--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Abstract

With advancements in space technology, remote sensing applications, and computer vision, significant improvements in the data describing our planet are seen today. Researchers want to gather different kinds of data and perform data fusion techniques between them to increase our understanding of the world. Two such data types are Electro-Optical images and Synthetic Aperture Radar images. For data fusion, the images need to be accurately aligned. Researchers have investigated methods for robustly and accurately registering these images for many years. However, recent advancements in imaging systems have made the problem more complex than ever.

Currently, the imaging satellites that capture information around the globe have achieved a resolution of less than a meter per pixel. There is an increase in signal complexity for high-resolution SAR images due to how the imaging system operates. Interference between waves gives rise to speckled noise and geometric distortions, making the images very difficult to interpret. This directly affects the image registration accuracy.

In this thesis, the complexity of the problem regarding registration between SAR and EO data was described, and methods for registering the images were investigated. The methods were feature- and area-based. The feature-based method used a KAZE filter and SURF descriptor. The method found many key points but few correct correspondences. The area-based methods used FFT and MI, respectively. FFT was deemed best for higher quality images, whereas MI better dealt with the non-linear intensity difference. More complex techniques, such as dense neural networks, were excluded. No method achieved satisfying results on the entire data set, but the area-based methods accomplished complementary results.

A conclusion was drawn that the distortions in the SAR images are too significant to register accurately using only CV algorithms. Since the area-based methods achieved good results on images excluding significant distortions, future work should focus on solving the geometrical errors and increasing the registration accuracy

Arvid Karlhede, "Online Camera-IMU Calibration", Student thesis, LiTH-ISY-EX--22/5524--SE, 2022.

AbstractKeywordsBiBTeXFulltext

Axel Ahlqvist, "Examining Difficulties in Weed Detection", Student thesis, No. , 2022.

AbstractKeywordsBiBTeXFulltext

Erik Örjehag, "Unsupervised Learning for Structure from Motion", Student thesis, LiTH-ISY-EX--21/5361--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Hanna Hamrell, "Image-to-Image Translation for Improvement of Synthetic Thermal Infrared Training Data Using Generative Adversarial Networks", Student thesis, LiTH-ISY-EX--21/5364--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Rolf Sievert, "Instance Segmentation of Multiclass Litter and Imbalanced Dataset Handling: A Deep Learning Model Comparison", Student thesis, LiTH-ISY-EX--21/5365--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Abstract

Instance segmentation has a great potential for improving the current state of littering by autonomously detecting and segmenting different categories of litter. With this information, litter could, for example, be geotagged to aid litter pickers or to give precise locational information to unmanned vehicles for autonomous litter collection. Land-based litter instance segmentation is a relatively unexplored field, and this study aims to give a comparison of the instance segmentation models Mask R-CNN and DetectoRS using the multiclass litter dataset called Trash Annotations in Context (TACO) in conjunction with the Common Objects in Context precision and recall scores. TACO is an imbalanced dataset, and therefore imbalanced data-handling is addressed, exercising a second-order relation iterative stratified split, and additionally oversampling when training Mask R-CNN. Mask R-CNN without oversampling resulted in a segmentation of 0.127 mAP, and with oversampling 0.163 mAP. DetectoRS achieved 0.167 segmentation mAP, and improves the segmentation mAP of small objects most noticeably, with a factor of at least 2, which is important within the litter domain since small objects such as cigarettes are overrepresented. In contrast, oversampling with Mask R-CNN does not seem to improve the general precision of small and medium objects, but only improves the detection of large objects. It is concluded that DetectoRS improves results compared to Mask R-CNN, as well does oversampling. However, using a dataset that cannot have an all-class representation for train, validation, and test splits, together with an iterative stratification that does not guarantee all-class representations, makes it hard for future works to do exact comparisons to this study. Results are therefore approximate considering using all categories since 12 categories are missing from the test set, where 4 of those were impossible to split into train, validation, and test set. Further image collection and annotation to mitigate the imbalance would most noticeably improve results since results depend on class-averaged values. Doing oversampling with DetectoRS would also help improve results. There is also the option to combine the two datasets TACO and MJU-Waste to enforce training of more categories.

Kerstin Söderqvist, "Anomaly Detection in Images and Videos Using Photo-Response Non-Uniformity", Student thesis, LiTH-ISY-EX--21/5367--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Abstract

When photos and videos are increasingly used as evidence material, it is of importance to know if these materials can be used as evidence material or if the risk of them being forged is impending. This thesis investigates methods for detecting anomalous regions in images and videos using photo-response non-uniformity -- a fixed-pattern sensor noise that can be estimated from photos or videos.

For photos, experiments were performed on a method that assumes other photos from the same camera are available. For videos, experiments were performed on a method further developed from the still image method, with other videos from the same camera being available. The last experiments were performed on videos when only the video that was about to be investigated was available.

The experiments on the still image method were performed on images with three different kinds of forged regions: a forged region from somewhere else in the same photo, a forged region from a photo taken by another camera, and a forged region from the same sensor position in a photo taken by the same camera. The method should not be able to detect the third kind of forged region. Experiments performed on videos had a forged region in several adjacent frames in the video. The forged region was from another video, and it moved and changed shape between the frames.

The methods mainly consist of a classification process and some post-processing. In the classification process, features were extracted from images/videos and used in a random forest classifier. The results are presented in precision, recall, F₁ score and false positive rate.

The quality of the still images was generally better than the videos, which also resulted in better results. For the cameras used in the experiments, it seemed easier to estimate a good PRNU pattern from photos and videos from older cameras. Probably due to sensor differences and extra processing in newer camera models. How the images and videos are compressed also affects the possibility to estimate a good PRNU pattern, because important information may then be lost.

Lovisa Nilsson, "Data-Driven Methods for Sonar Imaging", Student thesis, LiTH-ISY-EX--21/5381--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Per Antonsson, Jesper Johansson, "Measuring Respiratory Frequency Using Optronics and Computer Vision", Student thesis, LiTH-ISY-EX–21/5376–SE, 2021.

AbstractKeywordsBiBTeXFulltext

Abstract

This thesis investigates the development and use of software to measure respiratory frequency on cows using optronics and computer vision. It examines mainly two different strategies of image and signal processing and their performances for different input qualities. The effect of heat stress on dairy cows and the high transmission risk of pneumonia for calves make the investigation done during this thesis highly relevant since they both have the same symptom; increased respiratory frequency. The data set used in this thesis was of recorded dairy cows in different environments and from varying angles. Recordings, where the authors could determine a true breathing frequency by monitoring body movements, were accepted to the data set and used to test and develop the algorithms. One method developed in this thesis estimated the breathing rate in the frequency domain by Fast Fourier Transform and was named "N-point Fast Fourier Transform." The other method was called "Breathing Movement Zero-Crossing Counting." It estimated a signal in the time domain, whose fundamental frequency was determined by a zero-crossing algorithm as the breathing frequency. The result showed that both the developed algorithm successfully estimated a breathing frequency with a reasonable error margin for most of the data set. The zero-crossing algorithm showed the most consistent result with an error margin lower than 0.92 breaths per minute (BPM) for twelve of thirteen recordings. However, it is limited to recordings where the camera is placed above the cow. The N-point FFT algorithm estimated the breathing frequency with error margins between 0.44 and 5.20 BPM for the same recordings as the zero-crossing algorithm. This method is not limited to a specific camera angle but requires the cow to be relatively stationary to get accurate results. Therefore, it could be evaluated with the remaining three recordings of the data set. The error margins for these recordings were measured between 1.92 and 10.88 BPM. Both methods had execution time acceptable for implementation in real-time. It was, however, too incomplete a data set to determine any performance with recordings from different optronic devices.

Emma Wettermark, Linda Berglund, "Multi-Modal Visual Tracking Using Infrared Imagery", Student thesis, LiTH-ISY-EX--21/5401--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Jonas Rydgård, Marcus Bejgrowicz, "Semantic Segmentation of Building Materials in Real World Images Using 3D Information", Student thesis, LiTH-ISY-EX--21/5405--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Tim Yngesjö, "3D Reconstruction from Satellite Imagery Using Deep Learning", Student thesis, LiTH-ISY-Ex No. 21/5393–SE, 2021.

AbstractKeywordsBiBTeXFulltext

Gustav Wahlquist, "Improving Automatic Image Annotation Using Metadata", Student thesis, LiTH-ISY-EX–21/5398–SE, 2021.

AbstractKeywordsBiBTeXFulltext

Christian von Koch, William Anzén, "Detecting Slag Formation with Deep Learning Methods: An experimental study of different deep learning image segmentation models", Student thesis, LiTH-ISY-EX--21/5427--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Marcus Dahlqvist, "Adaptive Losses for Camera Pose Supervision", Student thesis, LiTH-ISY-EX--21/5422--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Sebastian Brundin, Adam Gräns, "Efficient Recycling Of Non-Ferrous Materials Using Cross-Modal Knowledge Distillation", Student thesis, LiTH-ISY-EX--21/5403--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Albin Konradsson, Gustav Bohman, "3D Instance Segmentation of Cluttered Scenes: A Comparative Study of 3D Data Representations", Student thesis, LiTH-ISY-EX--21/5421--SE, 2021.

AbstractKeywordsBiBTeXFulltext

HANG ZHAO, "Segmentation and synthesis of pelvic region CT images via neural networks trained on XCAT phantom data", Student thesis, No. , 2021.

AbstractKeywordsBiBTeXFulltext

Martin Björn, "Laterality Classification of X-Ray Images: Using Deep Learning", Student thesis, LiTH-ISY-EX--21/5417-SE, 2021.

AbstractKeywordsBiBTeXFulltext

Malin Rudin, "Evaluation of Optical Flow for Estimation of Liquid Glass Flow Velocity", Student thesis, LiTH-ISY-EX--21/5433--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Abstract

In the glass wool industry, the molten glass flow is monitored for regulation purposes. Given the progress in the computer vision field, the current monitoring solution might be replaced by a camera based solution. The aim of this thesis is to investigate the possibility of using optical flow techniques for estimation of the molten glass flow displacement.

Three glass melt flow datasets were recorded, as well as two additional melt flow datasets, using a NIR camera. The block matching techniques Full Search (FS) and Adaptive Rood Pattern Search (ARPS), as well as the local feature methods ORB and A-KAZE were considered. These four techniques were compared to RAFT, the state-of-the-art approach for optical flow estimation, using available pre-trained models, as well as an approach of using the tracking method ECO for the optical flow estimation.

The methods have been evaluated using the metrics MAE, MSE, and SSIM to compare the warped flow to the target image. In addition, ground truth for 50 frames from each dataset was manually annotated as to use the optical flow metric End-Point Error. To investigate the computational complexity the average computational time per frame was calculated.

The investigation found that RAFT does not perform well on the given data, due to the large displacements of the flows. For simulated displacements of up to about 100 pixels at full resolution, the performance is satisfactory, with results comparable to the traditional methods.

Using ECO for optical flow estimation encounters similar problems as RAFT, where the large displacement proved challenging for the tracker. Simulating smaller motions of up to 60 pixels resulted in good performance, though computation time of the used implementation is much too high for a real-time implementation.

The four traditional block matching and local feature approaches examined in this thesis outperform the state-of-the-art approaches. FS, ARPS, A-KAZE, and ORB all have similar performance on the glass flow datasets, whereas the block matching approaches fail on the alternative melt flow data as the template extraction approach is inadequate. The two local feature approaches, though working reasonably well on all datasets given full resolution, struggle to identify features on down-sampled data. This might be mitigated by fine-tuning the settings of the methods. Generally, ORB mostly outperforms A-KAZE with respect to the evaluation metrics, and is considerably faster.

Natalie Syrén Grönfelt, "Pretraining a Neural Network for Hyperspectral Images Using Self-Supervised Contrastive Learning", Student thesis, LiTH-ISY-EX–21/5382–SE, 2021.

AbstractKeywordsBiBTeXFulltext

Alice Velander, David Gumpert Harrysson, "Do Judge a Book by its Cover! Predicting the genre of book covers using supervised deep learning. Analyzing the model predictions using explanatory artificial intelligence methods and techniques.", Student thesis, No. , 2021.

AbstractKeywordsBiBTeXFulltext

Hannes Freij, "Hyperspectral Image Registration and Construction From Irregularly Sampled Data", Student thesis, LiTH-ISY-EX--21/5408--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Anton Hjert, "Machine Learning for LiDAR-SLAM: In Forest Terrains", Student thesis, LiTH-ISY-Ex No. 21/5442--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Abstract

Point set registration is a well-researched yet still not a very exploited area in computer vision. As the field of machine learning grows, the possibilities of application expand. This thesis investigates the possibility to expand an already implemented probabilistic machine learning approach to point set registration to more complex, larger datasets gathered in a forest environment. The system used as a starting point was created by Järemo Lawin et. al. [10]. The aim of the thesis was to investigate the possibility to register the forest data with the existing system, without ground-truth poses, with different optimizers, and to implement a SLAM pipeline. Also, older methods were used as a benchmark for evaluation, more specifically iterative closest point(ICP) and fast global registration(FGR).To enable the gathered data to be processed by the registration algorithms, preprocessing was required. Transforming the data points from the coordinate system of the sensor to world relative coordinates via LiDAR base coordinates. Subsequently, the registration was performed with different approaches. Both the KITTI odometry dataset, which RLLReg originally was evaluated with[10], and the gathered forest data were used. Data augmentation was utilized to enable ground-truth-independent training and to increase diversity in the data. In addition, the registration results were used to create a SLAM-pipeline, enabling mapping and localization in the scanned areas. The results showed great potential for using RLLReg to register forest scenes compared to other, older, approaches. Especially, the lack of ground-truth was manageable using data augmentation to create training data. Moreover, there was no evidence that AdaBound improves the system when replacing the Adam-optimizer. Finally, forest models with sensor paths plotted were generated with decent results. However, a potential for post-processing with further refinement is possible. Nevertheless, the possibility of point set registration and LiDAR-SLAM using machine learning has been confirmed.

Simon Gustavsson, "Object Detection and Semantic Segmentation Using Self-Supervised Learning", Student thesis, LiTH-ISY-EX–21/5357–SE, 2021.

AbstractKeywordsBiBTeXFulltext

Anton Ågren, "Automatic Colour Transfer for Geodata", Student thesis, LiTH-ISY-EX--21/5378--SE, 2021.

AbstractKeywordsBiBTeXFulltext

Denise Härnström, "Classification of Clothing Attributes Across Domains", Student thesis, LiTH-ISY-EX--20/5276--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Freja Fagerblom, "Model-Agnostic Meta-Learning for Digital Pathology", Student thesis, LiTH-ISY-EX--20/5284--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Karin Fritz, "Instance Segmentation of Buildings in Satellite Images", Student thesis, LiTH-ISY-EX--20/5283--SE, 2020.

AbstractKeywordsBiBTeXFulltext

David Pop, "Classification of Heart Views in Ultrasound Images", Student thesis, LiTH-ISY-EX--20/5288--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Abstract

In today’s society, we experience an increasing challenge to provide healthcare to everyone in need due to the increasing number of patients and the shortage of medical staff. Computers have contributed to mitigating this challenge by offloading the medical staff from some of the tasks. With the rise of deep learning, countless new possibilities have opened to help the medical staff even further. One domain where deep learning can be applied is analysis of ultrasound images. In this thesis we investigate the problem of classifying standard views of the heart in ultrasound images with the help of deep learning. We conduct mainly three experiments. First, we use NasNet mobile, InceptionV3, VGG16 and MobileNet, pre-trained on ImageNet, and finetune them to ultrasound heart images. We compare the accuracy of these networks to each other and to the baselinemodel, a CNN that was proposed in [23]. Then we assess a neural network’s capability to generalize to images from ultrasound machines that the network is not trained on. Lastly, we test how the performance of the networks degrades with decreasing amount of training data. Our first experiment shows that all networks considered in this study have very similar performance in terms of accuracy with Inception V3 being slightly better than the rest. The best performance is achieved when the whole network is finetuned to our problem instead of finetuning only apart of it, while gradually unlocking more layers for training. The generalization experiment shows that neural networks have the potential to generalize to images from ultrasound machines that they are not trained on. It also shows that having a mix of multiple ultrasound machines in the training data increases generalization performance. In our last experiment we compare the performance of the CNN proposed in [23] with MobileNet pre-trained on ImageNet and MobileNet randomly initialized. This shows that the performance of the baseline model suffers the least with decreasing amount of training data and that pre-training helps the performance drastically on smaller training datasets.

Ida Bjerwe, "Automatic Alignment Detection and Correction in Infrared and Visual Image Pairs", Student thesis, LiTH-ISY-EX--20/5292--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Ida Ejnestrand, Linnéa Jakobsson, "Object Tracking based on Eye Tracking Data: A comparison with a state-of-the-art video tracker", Student thesis, LiTH-ISY-EX--20/5294--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Björn Runow, "Deep Learning for Point Detection in Images", Student thesis, LiTH-ISY-EX--20/5295--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Johanna Tuvskog, "Evaluation of Face Recognition Accuracy in Surveillance Video", Student thesis, LiTH-ISY-EX--20/5302--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Mathias Kindstedt, "Exploring the Training Data for Online Learning of Autonomous Driving in a Simulated Environment", Student thesis, LiTH-ISY-EX--20/5325--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Nils Gehlin, Martin Antonsson, "Detecting Non-Natural Objects in a Natural Environment using Generative Adversarial Networks with Stereo Data", Student thesis, LiTH-ISY-EX--20/5324--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Johan Edstedt, "Towards Understanding Capsule Networks", Student thesis, LiTH-ISY-EX--20/5309--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Emil Luusua, "Vehicle Detection, at a Distance: Done Efficiently via Fusion of Short- and Long-Range Images", Student thesis, LiTH-ISY-EX--20/5328--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Abstract

Object detection is a classical computer vision task, encountered in many practical applications such as robotics and autonomous driving. The latter involves serious consequences of failure and a multitude of challenging demands, including high computational efficiency and detection accuracy. Distant objects are notably difficult to detect accurately due to their small scale in the image, consisting of only a few pixels. This is especially problematic in autonomous driving, as objects should be detected at the earliest possible stage to facilitate handling of hazardous situations. Previous work has addressed small objects via use of feature pyramids and super-resolution techniques, but the efficiency of such methods is limited as computational cost increases with image resolution. Therefore, a trade-off must be made between accuracy and cost. Opportunely though, a common characteristic of driving scenarios is the predominance of distant objects in the centre of the image. Thus, the full-frame image can be downsampled to reduce computational cost, and a crop can be extracted from the image centre to preserve resolution for distant vehicles. In this way, short- and long-range images are generated. This thesis investigates the fusion of such images in a convolutional neural network, particularly the fusion level, fusion operation, and spatial alignment. A novel framework — DetSLR — is proposed for the task and examined via the aforementioned aspects. Through adoption of the framework for the well-established SSD detector and MobileNetV2 feature extractor, it is shown that the framework significantly improves upon the original detector without incurring additional cost. The fusion level is shown to have great impact on the performance of the framework, favouring high-level fusion, while only insignificant differences exist between investigated fusion operations. Finally, spatial alignment of features is demonstrated to be a crucial component of the framework.

Martin Persson, "Automatic Gait Recognition: using deep metric learning", Student thesis, LIU-ISY/LITH-EX-A--20/5316--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Erik Dahlström, "Super-Resolution Using Dynamic Cameras", Student thesis, LiTH-ISY-EX--20/5315--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Erik Svensson, "Transfer Learning for Friction Estimation: Using Deep Reduced Features", Student thesis, LiTH-ISY-EX--20/5312--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Sabina Serra, "Deep Learning for Semantic Segmentation of 3D Point Clouds from an Airborne LiDAR", Student thesis, LiTH-ISY-EX--20/5331--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Tobias Löfgren, Daniel Jonsson, "Generating Synthetic Data for Evaluation and Improvement of Deep 6D Pose Estimation", Student thesis, LiTH-ISY-EX--20/5339--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Abstract

The task of 6D pose estimation with deep learning is to train networks to, from an im-age of an object, determine the rotation and translation of the object. Impressive resultshave recently been shown in deep learning based 6D pose estimation. However, many cur-rent solutions rely on real-world data when training, which as opposed to synthetic data,requires time consuming annotation. In this thesis, we introduce a pipeline for generatingsynthetic ground truth data for deep 6D pose estimation, where annotation is done auto-matically. With a 3D CAD-model, we use Blender to render 2D images of the model fromdifferent view points. We also create all other relevant data needed for pose estimation, e.g.,the poses of an object, mask images and 3D keypoints on the object. Using this pipeline, itis possible to adjust different settings to reduce the domain gap between synthetic data andreal-world data and get better pose estimation results. Such settings could be changing themethod of extracting 3D keypoints and varying the scale of the object or the light settingsin the scene.The network used to test the performance of training on our synthetic data is PVNet,which achieves state-of-the-art results for 6D pose estimation. This architecture learns tofind 2D keypoints of the object in the image, as well as 2D–3D keypoint correspondences.With these correspondences, the Perspective-n-Point (PnP) algorithm is used to extract apose. We evaluate the pose estimation of the different settings on the synthetic data andcompare these results to other state-of-the-art work. We find that using only real-worlddata for training is worse than using a combination of synthetic and real-world data. Sev-eral other findings are that varying scale and lightning, in addition to adding random back-ground images to the rendered images improves results. Four different novel keypoint se-lection methods are introduced in this work, and tried against methods used in previouswork. We observe that our methods achieve similar or better results. Finally, we use thebest possible settings from the synthetic data pipeline, but with memory limitations on theamount of training data. We are close to state-of-the-art results, and could get closer withmore data.

Frida Flodin, "Improved Data Association for Multi-Pedestrian Tracking Using Image Information", Student thesis, LiTH-ISY-EX--20/5329--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Alkazhami Emir, "Facial Identity Embeddings for Deepfake Detection in Videos", Student thesis, LiTH-ISY-EX--20/5341--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Ludwig Thaung, "Advanced Data Augmentation: With Generative Adversarial Networks and Computer-Aided Design", Student thesis, LiTH-ISY-EX--20/5340--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Mimmi Lindberg, "Forensic Validation of 3D models", Student thesis, LiTH-ISY-EX--20/5346--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Fredrik Almin, "Detection of Non-Ferrous Materials with Computer Vision", Student thesis, LiTH-ISY-EX--20/5321--SE, 2020.

AbstractKeywordsBiBTeXFulltext

Carl Dehlin, "Visual Tracking Using Stereo Images", Student thesis, LiTH-ISY-EX–18/5181–SE, 2019.

AbstractKeywordsBiBTeXFulltext

Victor Tranell, "Semantic Segmentation of Oblique Views in a 3D-Environment", Student thesis, LiTH-ISY-EX--18/5185--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Viktor Ringdahl, "Stereo Camera Pose Estimation to Enable Loop Detection", Student thesis, LiTH-ISY-EX--19/5186--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Abstract

Visual Simultaneous Localization And Mapping (SLAM) allows for three dimensionalreconstruction from a camera’s output and simultaneous positioning of the camera withinthe reconstruction. With use cases ranging from autonomous vehicles to augmentedreality, the SLAM field has garnered interest both commercially and academically.

A SLAM system performs odometry as it estimates the camera’s movement throughthe scene. The incremental estimation of odometry is not error free and exhibits driftover time with map inconsistencies as a result. Detecting the return to a previously seenplace, a loop, means that this new information regarding our position can be incorporatedto correct the trajectory retroactively. Loop detection can also facilitate relocalization ifthe system loses tracking due to e.g. heavy motion blur.

This thesis proposes an odometric system making use of bundle adjustment within akeyframe based stereo SLAM application. This system is capable of detecting loops byutilizing the algorithm FAB-MAP. Two aspects of this system is evaluated, the odometryand the capability to relocate. Both of these are evaluated using the EuRoC MAV dataset,with an absolute trajectory RMS error ranging from 0.80 m to 1.70 m for the machinehall sequences.

The capability to relocate is evaluated using a novel methodology that intuitively canbe interpreted. Results are given for different levels of strictness to encompass differentuse cases. The method makes use of reprojection of points seen in keyframes to definewhether a relocalization is possible or not. The system shows a capability to relocate inup to 85% of all cases when a keyframe exists that can project 90% of its points intothe current view. Errors in estimated poses were found to be correlated with the relativedistance, with errors less than 10 cm in 23% to 73% of all cases.

The evaluation of the whole system is augmented with an evaluation of local imagedescriptors and pose estimation algorithms. The descriptor SIFT was found to performbest overall, but demanding to compute. BRISK was deemed the best alternative for afast yet accurate descriptor.

Conclusions that can be drawn from this thesis is that FAB-MAP works well fordetecting loops as long as the addition of keyframes is handled appropriately.

Andreas Norrstig, "Visual Object Detection using Convolutional Neural Networks in a Virtual Environment", Student thesis, LiTH-ISY-EX–19/5195–SE, 2019.

AbstractKeywordsBiBTeXFulltext

Linnea Fridman, Victoria Nordberg, "Two Multimodal Image Registration Approaches for Positioning Purposes", Student thesis, LiTH-ISY-EX--19/5208--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Carl Ekman, "Traffic Sign Classification Using Computationally Efficient Convolutional Neural Networks", Student thesis, LiTH-ISY-EX--19/5216--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Jakob Grönlund, Angelina Johansson, "Defect Detection and OCR on Steel", Student thesis, LiTH-ISY-EX--19/5220--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Malcolm Vigren, Linus Eriksson, "End-to-End Road Lane Detection and Estimation using Deep Learning", Student thesis, LiTH-ISY-EX--19/5219--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Axel Nyström, "Evaluation of Multiple Object Tracking in Surveillance Video", Student thesis, LiTH-ISY-EX--19/5245--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Linbo He, "Improving 3D Point Cloud Segmentation Using Multimodal Fusion of Projected 2D Imagery Data: Improving 3D Point Cloud Segmentation Using Multimodal Fusion of Projected 2D Imagery Data", Student thesis, LiTH-ISY-EX--19/5190--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Martin Estgren, "Bone Fragment Segmentation Using Deep Interactive Object Selection", Student thesis, LiTH-ISY-EX--19/5197--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Annette Lef, "CAD-Based Pose Estimation - Algorithm Investigation", Student thesis, LiTH-ISY-EX--19/5239--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Jonathan Sjölund, "Detection of Frozen Video Subtitles Using Machine Learning", Student thesis, LiTH-ISY-EX--19/5206--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Daniel Cranston, Filip Skarfelt, "Normalized Convolution Network and Dataset Generation for Refining Stereo Disparity Maps", Student thesis, LiTH-ISY-EX--19/5252--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Lukas Tegendal, "Watermarking in Audio using Deep Learning", Student thesis, LiTH-ISY-EX--19/5246--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Fredrik Grahn, Kristian Nilsson, "Object Detection in Domain Specific Stereo-Analysed Satellite Images", Student thesis, LiTH-ISY-EX--19/5254--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Susanna Larsson, "Monocular Depth Estimation Using Deep Convolutional Neural Networks", Student thesis, LiTH-ISY-EX--19/5234--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Maria Kastberg, "Using Convolutional Neural Networks to Detect People Around Wells in South Sudan", Student thesis, LiTH-ISY-EX--19/5200--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Anna Birgersson, Klara Hellgren, "Texture Enhancement in 3D Maps using Generative Adversarial Networks", Student thesis, LiTH-ISY-EX--19/5266--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Abstract

In this thesis we investigate the use of GANs for texture enhancement. To achievethis, we have studied if synthetic satellite images generated by GANs will improvethe texture in satellite-based 3D maps.

We investigate two GANs; SRGAN and pix2pix. SRGAN increases the pixelresolution of the satellite images by generating upsampled images from low resolutionimages. As for pip2pix, the GAN performs image-to-image translation bytranslating a source image to a target image, without changing the pixel resolution.

We trained the GANs in two different approaches, named SAT-to-AER andSAT-to-AER-3D, where SAT, AER and AER-3D are different datasets provided bythe company Vricon. In the first approach, aerial images were used as groundtruth and in the second approach, rendered images from an aerial-based 3D mapwere used as ground truth.

The procedure of enhancing the texture in a satellite-based 3D map was dividedin two steps; the generation of synthetic satellite images and the re-texturingof the 3D map. Synthetic satellite images generated by two SRGAN models andone pix2pix model were used for the re-texturing. The best results were presentedusing SRGAN in the SAT-to-AER approach, in where the re-textured 3Dmap had enhanced structures and an increased perceived quality. SRGAN alsopresented a good result in the SAT-to-AER-3D approach, where the re-textured3D map had changed color distribution and the road markers were easier to distinguishfrom the ground. The images generated by the pix2pix model presentedthe worst result. As for the SAT-to-AER approach, even though the syntheticsatellite images generated by pix2pix were somewhat enhanced and containedless noise, they had no significant impact in the re-texturing. In the SAT-to-AER-3D approach, none of the investigated models based on the pix2pix frameworkpresented any successful results.

We concluded that GANs can be used as a texture enhancer using both aerialimages and images rendered from an aerial-based 3D map as ground truth. Theuse of GANs as a texture enhancer have great potential and have several interestingareas for future works.

Johan Thornström, "Domain Adaptation of Unreal Images for Image Classification", Student thesis, LiTH-ISY-EX–20/5282–SE, 2019.

AbstractKeywordsBiBTeXFulltext

Christoffer Malmgren, "A Comparative Study of Routing Methods in Capsule Networks", Student thesis, LiTH-ISY-EX--19/5188--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Goutam Bhat, "Accurate Tracking by Overlap Maximization", Student thesis, LiTH-ISY-EX--19/5189--SE, 2019.

AbstractKeywordsBiBTeXFulltext

Carl Sundelius, "Deep Fusion of Imaging Modalities for Semantic Segmentation of Satellite Imagery", Student thesis, LiTH-ISY-EX--18/5110--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Andreas Brorsson, "Compressive Sensing: Single Pixel SWIR Imaging of Natural Scenes", Student thesis, LiTH-ISY-EX--18/5108--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Mattias Carlsson, "Neural Networks for Semantic Segmentation in the Food Packaging Industry", Student thesis, LiTH-ISY-EX--18/5113--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Oliver Strömgren, "Deep Learning for Autonomous Collision Avoidance", Student thesis, LiTH-ISY-EX--18/5115--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Johanna Hultberg, "Dehazing of Satellite Images", Student thesis, LiTH-ISY-EX--18/5121--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Björn Kernell, "Improving Photogrammetry using Semantic Segmentation", Student thesis, LiTH-ISY-EX--18/5118--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Fredrik Gustafsson, Erik Linder-Norén, "Automotive 3D Object Detection Without Target Domain Annotations", Student thesis, LiTH-ISY-EX--18/5138--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Isak Strömberg, "Characterization of creping marks in paper", Student thesis, LiTH-ISY-EX--18/5151--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Kevin Kjellén, "Point Cloud Registration in Augmented Reality using the Microsoft HoloLens", Student thesis, LiTH-ISY-EX–18/5160–SE, 2018.

AbstractKeywordsBiBTeXFulltext

Jessica Sällqvist, "Real-time 3D Semantic Segmentation of Timber Loads with Convolutional Neural Networks", Student thesis, LiTH-ISY-EX--18/5131--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Mats Nilsson, "Building Reconstruction of Digital Height Models with the Markov Chain Monte Carlo Method", Student thesis, LiTH-ISY-EX--18/5130--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Fredrik Olsson, "Feature Based Learning for Point Cloud Labeling and Grasp Point Detection", Student thesis, LiTH-ISY-EX--18/5165--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Tobias Grundström, "Automated Measurements of Liver Fat Using Machine Learning", Student thesis, LiTH-ISY-EX--18/5166--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Marcus Ekström, "Road Surface Preview Estimation Using a Monocular Camera", Student thesis, LiTH-ISY-EX--18/5173--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Adam Nyberg, "Transforming Thermal Images to Visible Spectrum Images Using Deep Learning", Student thesis, LiTH-ISY-EX–18/5167–SE, 2018.

AbstractKeywordsBiBTeXFulltext

Petter Stenhagen, "Improving Realism in Synthetic Barcode Images using Generative Adversarial Networks", Student thesis, LiTH-ISY-EX--18/5169--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Michael Sörsäter, "Active Learning for Road Segmentation using Convolutional Neural Networks", Student thesis, LiTH-ISY-EX--18/5176--SE, 2018.

AbstractKeywordsBiBTeXFulltext

Margareta Vi, "Object Detection Using Convolutional Neural Network Trained on Synthetic Images", Student thesis, LiTH-ISY-EX--18/5180--SE, 2018.

AbstractKeywordsBiBTeXFulltext

John Stynsberg, "Incorporating Scene Depth in Discriminative Correlation Filters for Visual Tracking", Student thesis, LiTH-ISY-EX–18/5178–SE, 2018.

AbstractKeywordsBiBTeXFulltext

Karl Holmquist, "SLAMIt A Sub-Map Based SLAM System: On-line creation of multi-leveled map", Student thesis, LiTH-ISY-EX--16/5021--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Benjamin Lind, "Artificial Neural Networks for Image Improvement", Student thesis, LiTH-ISY-EX–17/5025–SE, 2017.

AbstractKeywordsBiBTeXFulltext

Marcus Lind, "Automatic Segmentation of Knee Cartilage Using Quantitative MRI Data", Student thesis, LiTH-ISY-EX--17/5041--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Ebba Wimby Schmidt, "Navigability Assessment for Autonomous Systems Using Deep Neural Networks", Student thesis, LiTH-ISY-EX--17/5045--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Magnus Björnfot, "Extension of DIRA (Dual-Energy Iterative Algorithm) to 3D Helical CT", Student thesis, LiTH-ISY-EX--17/5057--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Joakim Johnander, "Visual Tracking with Deformable Continuous Convolution Operators", Student thesis, LiTH-ISY-EX--17/5047--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Johan Manfredsson, "Evaluation Tool for a Road Surface Algorithm", Student thesis, LiTH-ISY-EX--17/5063--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Magnus Ivarsson, "Evaluation of 3D MRI Image Registration Methods", Student thesis, LiTH-ISY-EX--17/5037--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Maja Ilestrand, "Automatic Eartag Recognition on Dairy Cows in Real Barn Environment", Student thesis, LiTH-ISY-EX--17/5072--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Viktor Andersson, "Semantic Segmentation: Using Convolutional Neural Networks and Sparse dictionaries", Student thesis, LiTH-ISY-EX--17/5054--SE, 2017.

AbstractKeywordsBiBTeXFulltext

alexander poole, "Real-Time Image Segmentation for Augmented Reality by Combiningmulti-Channel Thresholds.", Student thesis, LiTH-ISY-EX--17/5083--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Matilda Lorentzon, "Feature Extraction for Image Selection Using Machine Learning", Student thesis, LiTH-ISY-EX--17/5097--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Emil Rundgren, "Automatic Volume Estimation of Timber from Multi-View Stereo 3D Reconstruction", Student thesis, LiTH-ISY-EX--17/5093--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Robert Norlander, "Make it Complete: Surface Reconstruction Aided by Geometric Primitives", Student thesis, LiTH-ISY-EX--17/5096--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Bertil Grelsson, Michael Felsberg, "Performance boost in Convolutional Neural Networks by tuning shifted activation functions", -, 2017.

AbstractKeywordsBiBTeX

Marcus Fallqvist, "Automatic Volume Estimation Using Structure-from-Motion Fused with a Cellphone's Inertial Sensors", Student thesis, LiTH-ISY-EX--17/5107--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Fredrik Fridborn, "Reading Barcodes with Neural Networks", Student thesis, LiTH-ISY-EX--17/5102--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Johan Lind, "Make it Meaningful: Semantic Segmentation of Three-Dimensional Urban Scene Models", Student thesis, LiTH-ISY-EX--17/5103--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, "Five years after the Deep Learning revolution of computer vision: State of the art methods for online image and video analysis", -, 2017.

AbstractKeywordsBiBTeXFulltext

Matthieu Zins, "Color Fusion and Super-Resolution for Time-of-Flight Cameras", Student thesis, LiTH-ISY-EX--17/5089--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Marcus Wälivaara, "General Object Detection Using Superpixel Preprocessing", Student thesis, LiTH-ISY-EX–17/5085–SE, 2017.

AbstractKeywordsBiBTeXFulltext

Dennis Lundström, "Data-efficient Transfer Learning with Pre-trained Networks", Student thesis, LiTH-ISY-EX--17/5051--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Joakim Svensk, "Evaluation of Aerial Image Stereo Matching Methods for Forest Variable Estimation", Student thesis, LiTH-ISY-EX--17/5036--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Magnus Wedberg, "Detecting Rails in Images from a Train-Mounted Thermal Camera Using a Convolutional Neural Network", Student thesis, LiTH-ISY-EX--17/5058--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Armand Moulis, "Automatic Detection and Classification of Permanent and Non-Permanent Skin Marks", Student thesis, LiTH-ISY-EX--17/5048--SE , 2017.

AbstractKeywordsBiBTeXFulltext

Patrik Tosteberg, "Semantic Segmentation of Point Clouds Using Deep Learning", Student thesis, LiTH-ISY-EX--17/5029--SE, 2017.

AbstractKeywordsBiBTeXFulltext

Abdelrahman Eldesokey, "Normalized Convolutional Neural Networks for Sparse Data", LiTH-ISY-R, No. 3096, 2017.

KeywordsBiBTeX

Tobias Norlund, "The Use of Distributional Semantics in Text Classification Models: Comparative performance analysis of popular word embeddings", Student thesis, LiTH-ISY-EX--16/4926--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Elin Andersson, "Thermal Impact of a Calibrated Stereo Camera Rig", Student thesis, LiTH-ISY-EX–16/4980–SE, 2016.

AbstractKeywordsBiBTeXFulltext

Karin Stacke, "Automatic Brain Segmentation into Substructures Using Quantitative MRI", Student thesis, LiTH-ISY-EX--16/4956--SE, 2016.

AbstractKeywordsBiBTeXFulltext

David Habrman, "Face Recognition with Preprocessing and Neural Networks", Student thesis, LiTH-ISY-EX--16/4953--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Mikael Jonsson, "Make it Flat: Detection and Correction of Planar Regions in Triangle Meshes", Student thesis, LiTH-ISY-EX--16/4930--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Lukas Berglin, "Design, Evaluation and Implementation of a Pipeline for Semi-Automatic Lung Nodule Segmentation", Student thesis, LiTH-ISY-EX--16/4925--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Richard Bondemark, "Improving SLAM on a TOF Camera by Exploiting Planar Surfaces", Student thesis, LiTH-ISY-EX--16/4984--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Hannes Järrendahl, "Automatic Detection of Anatomical Landmarks in Three-Dimensional MRI", Student thesis, LiTH-ISY-EX--16/4990--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Ola Grankvist, "Recognition and Registration of 3D Models in Depth Sensor Data", Student thesis, LiTH-ISY-EX--16/4993--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Victoria Härd, "Automatic Alignment of 2D Cine Morphological Images Using 4D Flow MRI Data", Student thesis, LiTH-ISY-EX–16/4992–SE, 2016.

AbstractKeywordsBiBTeXFulltext

Abstract

Cardiovascular diseases are among the most common causes of death worldwide. One of the recently developed flow analysis technique called 4D flow magnetic resonance imaging (MRI) allows an early detection of such diseases. Due to the limited resolution and contrast between blood pool and myocardium of 4D flow images, cine MR images are often used for cardiac segmentation. The delineated structures are then transferred to the 4D Flow images for cardiovascular flow analysis. Cine MR images are however acquired with multiple breath-holds, which can be challenging for some people, especially, when a cardiovascular disease is present. Consequently, unexpected breathing motion by a patient may lead to misalignments between the acquired cine MR images.

The goal of the thesis is to test the feasibility of an automatic image registration method to correct the misalignment caused by respiratory motion in morphological 2D cine MR images by using the 4D Flow MR as the reference image. As a registration method relies on a set of optimal parameters to provide desired results, a comprehensive investigation was performed to find such parameters. Different combinations of registration parameters settings were applied on 20 datasets from both healthy volunteers and patients. The best combinations, selected on the basis of normalized cross-correlation, were evaluated using the clinical gold-standard by employing widely used geometric measures of spatial correspondence. The accuracy of the best parameters from geometric evaluation was finally validated by using simulated misalignments.

Using a registration method consisting of only translation improved the results for both datasets from healthy volunteers and patients and the simulated misalignment data. For the datasets from healthy volunteers and patients, the registration improved the results from 0.7074 ± 0.1644 to 0.7551 ± 0.0737 in Dice index and from 1.8818 ± 0.9269 to 1.5953 ± 0.5192 for point-to-curve error. These values are a mean value for all the 20 datasets.

The results from geometric evaluation on the data from both healthy volunteers and patients show that the developed correction method is able to improve the alignment of the cine MR images. This allows a reliable segmentation of 4D flow MR images for cardiac flow assessment.

Madeleine Stein, "Improving Image Based Fruitcount Estimates Using Multiple View-Points", Student thesis, LiTH-ISY-EX--16/5003--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Lukas Tallund, "Handling of Rolling Shutter Effects in Monocular Semi-Dense SLAM Algorithms", Student thesis, LiTH-ISY-EX--16/5016--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Gustav Tapper, "Extraction of DTM from Satellite Images Using Neural Networks", Student thesis, LiTH-ISY-EX--16/5017--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Maja Gasslander, "Segmentation of Clouds in Satellite Images", Student thesis, LiTH-ISY-EX--16/4945--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Susanna Gladh, "Visual Tracking Using Deep Motion Features", Student thesis, LiTH-ISY-EX--16/5005--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Pontus Lindberg, "Automatisk volymmätning av virkestravar på lastbil", Student thesis, LiTH-ISY-EX--16/4955--SE, 2016.

AbstractKeywordsBiBTeXFulltext

Abstract

Automatisk travmätning är ett mätsystem som mäter vedvolymen på virkeslastbilar. Systemet består av sex stycken sensor-system. Varje sensor kalibreras först individuellt och sedan ihop för att ge ett sammanfogat världskoordinat system. Varje sensor genererar en djupbild och en reﬂektansbild, där värdena i djupbilden representerar avståndet från kameran. Uppdragsgivaren har utvecklat en algoritm som utifrån mätdatat(bilderna) uppskattar vedvolymen till en viss noggrannhet som uppfyller kraven ställda av skogsindustrin för automatisk mätning av travar på virkeslastbil. I den här rapporten undersöks om bättre mätresultat kan uppnås exempelvis med andra metoder eller kombinationer av dem.Till förfogande ﬁnns ca 125 dataset av travar där facit ﬁnns. Facit består av manuella stickprovsmätningar där varje enskild stock mätts för sig. Initialt valdes aktivt att inte sätta sig in i uppdragsgivarens algoritm för att inte bli färgad av hur de kommit fram till sina resultat. Främst används fram- och baksidebilderna av entrave för att hitta stockarna. Därefter interpoleras de funna stockarna in till mitten av traven eller så paras stockarna ihop från de båda sidorna. Ibland ﬁnns vissa problem med bilderna. Oftast är minst en av sidorna ockluderade av lastbilshytten, kranen eller en annan trave. Då gäller det att hitta uppskattning utifrån det data man ser för fylla upp de skymda områdena.I början av examensarbetet användes två metoder(MSER och Punktplanmetoden) för undersöka om man kunde uppnå bra resultat utifrån att enbart mäta datat och användadet som initial gissning till volymen. Dock upptäcktes det att värdefulla detaljer i dataseten missades för att mer noggrant bestämma vedvolymen. Exempel på sådan data är fördelningen av diametern på de funna stockändarna. Tillika tenderades kraftig överestimering när travarna innehöll en viss mängd ris och eller dåligt kvistade stockar. Därefter konstruerades en geometrisk metod, och det var den här metoden som det lades mest tid på.I ﬁgurerna nedan visas en tabell och en graf där alla tre metoders resultat under bark(UB) visas och intervall gränserna för att uppfylla kraven ställda av skogsindustrin.

Carl Karlsson Schmidt, "Rhino and Human Detection in Overlapping RGB and LWIR Images", Student thesis, LiTH-ISY-EX--15/4837--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Felix Järemo Lawin, "Depth Data Processing and 3D Reconstruction Using the Kinect v2", Student thesis, LiTH-ISY-EX–15/4884–SE, 2015.

AbstractKeywordsBiBTeXFulltext

Abstract

The Kinect v2 is a RGB-D sensor manufactured as a gesture interaction tool for the entertainment console XBOX One. In this thesis we will use it to perform 3D reconstruction and investigate its ability to measure depth. In order to sense both color and depth the Kinect v2 has two cameras: one RGB camera and one infrared camera used to produce depth and near infrared images. These cameras need to be calibrated if we want to use them for 3D reconstruction. We present a calibration procedure for simultaneously calibrating the cameras and extracting their relative pose. This enables us to construct colored meshes of the environment. When we know the camera parameters of the infrared camera, the depth images could be used to perform the Kinectfusion algorithm. This produces well-formed meshes of the environment by combining many depth frames taken from several camera poses.The Kinect v2 uses a time-of-flight technology were the phase shifts are extracted from amplitude modulated infrared light signals produced by an emitter. The extracted phase shifts are then converted to depth values. However, the extraction of phase shifts includes a phase unwrapping procedure, which is sensitive to noise and can result in large depth errors.By utilizing the ability to access the raw phase measurements from the device we managed to modify the phase unwrapping procedure. This new procedure includes an extraction of several hypotheses for the unwrapped phase and a spatial propagation to select amongst them. This proposed method has been compared with the available drivers in the open source library libfreenect2 and the Microsoft Kinect SDK v2. Our experiments show that the depth images of the two available drivers have similar quality and our proposed method improves over libfreenect2. The calculations in the proposed method are more expensive than those in libfreenect2 but it still runs at 2.5× real time. However, contrary to libfreenect2 the proposed method lacks a filter that removes outliers from the depth images. It turned out that this is an important feature when performing Kinect fusion and future work should thus be focused on adding an outlier filter.

Alexander Vibeck, "Synchronization of a Multi Camera System", Student thesis, LiTH-ISY-EX-ET--15/0438--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Patrik Hillgren, "Geometric Scene Labeling for Long-Range Obstacle Detection", Student thesis, LiTH-ISY-EX--14/4819--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Daniel Sandsveden, "Evaluation of Random Forests for Detection and Localization of Cattle Eyes", Student thesis, LiTH-ISY-EX--15/4885--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Oliver Larsson, "Evaluation of Flatness Gauge for Hot Rolling Mills", Student thesis, LiTH-ISY-EX--15/4894--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Niklas Hansson, "Color Features for Boosted Pedestrian Detection", Student thesis, LiTH-ISY-EX--15/4899--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Abstract

The car has increasingly become more and more intelligent throughout the years. Today's radar and vision based safety systems can warn a driver and brake the vehicle automatically if obstacles are detected. Research projects such as the Google Car have even succeeded in creating fully autonomous cars.

The demands to obtain the highest rating in safety tests such as Euro NCAP are also steadily increasing, and as a result, the development of these systems have become more attractive for car manufacturers. In the near future, a car must have a system for detecting, and performing automatic braking for pedestrians to receive the highest safety rating of five stars. The prospect is that the volume of active safety system will increase drastically when the car manufacturers start installing them in not only luxury cars, but also in the regularly priced ones. The use of automatic braking comes with a high demand on the performance of active safety systems, false positives must be avoided at all costs.

Dollar et al. [2014] introduced Aggregated Channel Features (ACF) which is based on a 10-channel LUV+HOG feature map. The method uses decision trees learned from boosting and has been shown to outperform previous algorithms in object detection tasks. The rediscovery of neural networks, and especially Convolutional Neural Networks (CNN) has increased the performance in almost every field of machine learning, including pedestrian detection. Recently Yang et al.[2015] combined the two approaches by using the the feature maps from a CNN as input to a decision tree based boosting framework. This resulted in state of the art performance on the challenging Caltech pedestrian data set.

This thesis presents an approach to improve the performance of a cascade of boosted classifiers by investigating the impact of using color information for pedestrian detection. The color self similarity feature introduced by Walk et al.[2010] was used to create a version better adapted for boosting. This feature is then used in combination with a gradient based feature at the last step of a cascade.

The presented feature increases the performance compared to currently used classifiers at Autoliv, on data recorded by Autoliv and on the benchmark Caltech pedestrian data set.

Benjamin Ingberg, "Registration of 2D Objects in 3D data", Student thesis, LiTH-ISY-EX–15/4848–SE, 2015.

AbstractKeywordsBiBTeXFulltext

Niklas Rydholm, "Panoramic Video Stitching", Student thesis, LiTH-ISY-EX--15/4858--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Peter Thulin, "Anomaly Detection for Product Inspection and Surveillance Applications", Student thesis, LiTH-ISY-EX--15/4889--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Anna Söderroos, "Fisheye Camera Calibration and Image Stitching for Automotive Applications", Student thesis, LiTH-ISY-EX--15/4887--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Gustav Häger, "Improving Discriminative Correlation Filters for Visual Tracking", Student thesis, LiTH-ISY-EX-15/4919--SE, 2015.

AbstractKeywordsBiBTeXFulltext

David Molin, "Pedestrian Detection Using Convolutional Neural Networks", Student thesis, LiTH-ISY-EX–15/4855–SE, 2015.

AbstractKeywordsBiBTeXFulltext

Olle Fridolfsson, "Machine Learning: for Barcode Detection and OCR", Student thesis, LiTH-ISY-Ex--15/4842--SE, 2015.

AbstractKeywordsBiBTeXFulltext

Mikael Persson, "Online Monocular SLAM: Rittums", Student thesis, Lith-ISY-EX--13/4741-SE, 2014.

AbstractKeywordsBiBTeXFulltext

Mattias Nilsson, "Evaluation of Computer Vision Algorithms Optimized for Embedded GPU:s.", Student thesis, LiTH-ISY-EX--14/4816--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Sanna Ringqvist, "Classification of terrain using superpixel segmentation and supervised learning", Student thesis, LiTH-ISY-EX--14/4752--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Erik Fall, "Compressed Sensing for 3D Laser Radar", Student thesis, LiTH-ISY-EX–-14/4767-–SE, 2014.

AbstractKeywordsBiBTeXFulltext

Abstract

High resolution 3D images are of high interest in military operations, where data can be used to classify and identify targets. The Swedish defence research agency (FOI) is interested in the latest research and technologies in this area. A draw- back with normal 3D-laser systems are the lack of high resolution for long range measurements. One technique for high long range resolution laser radar is based on time correlated single photon counting (TCSPC). By repetitively sending out short laser pulses and measure the time of flight (TOF) of single reflected pho- tons, extremely accurate range measurements can be done. A drawback with this method is that it is hard to create single photon detectors with many pixels and high temporal resolution, hence a single detector is used. Scanning an entire scene with one detector is very time consuming and instead, as this thesis is all about, the entire scene can be measured with less measurements than the number of pixels. To do this a technique called compressed sensing (CS) is introduced. CS utilizes that signals normally are compressible and can be represented sparse in some basis representation. CS sets other requirements on the sampling compared to the normal Shannon-Nyquist sampling theorem. With a digital micromirror device (DMD) linear combinations of the scene can be reflected onto the single photon detector, creating scalar intensity values as measurements. This means that fewer DMD-patterns than the number of pixels can reconstruct the entire 3D-scene. In this thesis a computer model of the laser system helps to evaluate different CS reconstruction methods with different scenarios of the laser system and the scene. The results show how many measurements that are required to reconstruct scenes properly and how the DMD-patterns effect the results. CS proves to enable a great reduction, 85 − 95 %, of the required measurements com- pared to pixel-by-pixel scanning system. Total variation minimization proves to be the best choice of reconstruction method.

Markus Landberg, "Enhancement Techniques for Lane PositionAdaptation (Estimation) using GPS- and Map Data", Student thesis, LiTH-ISY-EX--14/4788--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Morgan Bengtsson, "Indoor 3D Mapping using Kinect", Student thesis, LiTH-ISY-EX--14/4753--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Martin Svensson, "Accelerated Volumetric Next-Best-View Planning in 3D Mapping", Student thesis, LiTH-ISY-EX--14/4801--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Andreas Robinson, "Implementation and evaluation of a 3D tracker", Student thesis, LiTH-ISY-EX--14/4800--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Nikolaus Widebäck West, "Multiple Session 3D Reconstruction using RGB-D Cameras", Student thesis, LiTH-ISY-EX--14/4814--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Abstract

In this thesis we study the problem of multi-session dense rgb-d slam for 3D reconstruc- tion. Multi-session reconstruction can allow users to capture parts of an object that could not easily be captured in one session, due for instance to poor accessibility or user mistakes. We first present a thorough overview of single-session dense rgb-d slam and describe the multi-session problem as a loosening of the incremental camera movement and static scene assumptions commonly held in the single-session case. We then implement and evaluate sev- eral variations on a system for doing two-session reconstruction as an extension to a single- session dense rgb-d slam system.

The extension from one to several sessions is divided into registering separate sessions into a single reference frame, re-optimizing the camera trajectories, and fusing together the data to generate a final 3D model. Registration is done by matching reconstructed models from the separate sessions using one of two adaptations on a 3D object detection pipeline. The registration pipelines are evaluated with many different sub-steps on a challenging dataset and it is found that robust registration can be achieved using the proposed methods on scenes without degenerate shape symmetry. In particular we find that using plane matches between two sessions as constraints for as much as possible of the registration pipeline improves results.

Several different strategies for re-optimizing camera trajectories using data from both ses- sions are implemented and evaluated. The re-optimization strategies are based on re- tracking the camera poses from all sessions together, and then optionally optimizing over the full problem as represented on a pose-graph. The camera tracking is done by incrementally building and tracking against a tsdf volume, from which a final 3D mesh model is extracted. The whole system is qualitatively evaluated against a realistic dataset for multi-session re- construction. It is concluded that the overall approach is successful in reconstructing objects from several sessions, but that other fine grained registration methods would be required in order to achieve multi-session reconstructions that are indistinguishable from singe-session results in terms of reconstruction quality.

Alexander Sjöholm, "Closing the Loop: Mobile Visual Location Recognition", Student thesis, LiTH-ISY-EX--14/4813--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Rolf Kargén, "Utveckling av ett active vision system för demonstration av EDSDK++ i tillämpningar inom datorseende", Student thesis, LiTH-ISY-EX-ET--14/0419--SE, 2014.

AbstractKeywordsBiBTeXFulltext

Eric Gratorp, "Evaluation of online hardware video stabilization on a moving platform", Student thesis, LiTH-ISY-EX--13/4723--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Astrid de Laval, "Online Calibration of Camera Roll Angle", Student thesis, LiTH-ISY-EX--13/4688--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Daniel Rydström, "Calibration of Laser Triangulating Cameras in Small Fields of View", Student thesis, LiTH-ISY-EX--13/4669--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Kristofer Höglund, "Non-destructive Testing Using Thermographic Image Processing", Student thesis, LiTH-ISY-EX--13/4655--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Richard Ekblad, "Korrelering mellan optiskt och akustiskt avbildade objekt på havsbotten", Student thesis, LiTH-ISY-EX--13/4742--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Niklas Pettersson, "GPU-Accelerated Real-Time Surveillance De-Weathering", Student thesis, LiTH-ISY-EX--13/4677--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Amanda Berg, "Classification of leakage detections acquired by airborne thermography of district heating networks", Student thesis, LiTH-ISY-EX--13/4678--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Abstract

In Sweden and many other northern countries, it is common for heat to be distributed to homes and industries through district heating networks. Such networks consist of pipes buried underground carrying hot water or steam with temperatures in the range of 90-150 C. Due to bad insulation or cracks, heat or water leakages might appear.

A system for large-scale monitoring of district heating networks through remote thermography has been developed and is in use at the company Termisk Systemteknik AB. Infrared images are captured from an aircraft and analysed, finding and indicating the areas for which the ground temperature is higher than normal. During the analysis there are, however, many other warm areas than true water or energy leakages that are marked as detections. Objects or phenomena that can cause false alarms are those who, for some reason, are warmer than their surroundings, for example, chimneys, cars and heat leakages from buildings.

During the last couple of years, the system has been used in a number of cities. Therefore, there exists a fair amount of examples of different types of detections. The purpose of the present master’s thesis is to evaluate the reduction of false alarms of the existing analysis that can be achieved with the use of a learning system, i.e. a system which can learn how to recognize different types of detections.

A labelled data set for training and testing was acquired by contact with customers. Furthermore, a number of features describing the intensity difference within the detection, its shape and propagation as well as proximity information were found, implemented and evaluated. Finally, four different classifiers and other methods for classification were evaluated.

The method that obtained the best results consists of two steps. In the initial step, all detections which lie on top of a building are removed from the data set of labelled detections. The second step consists of classification using a Random forest classifier. Using this two-step method, the number of false alarms is reduced by 43% while the percentage of water and energy detections correctly classified is 99%.

Niklas Ollesson, "Automatic Configuration of Vision Sensor", Student thesis, LiTH-ISY-EX--13/4666--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Felix Björkeson, "Autonomous Morphometrics using Depth Cameras for Object Classification and Identification", Student thesis, LiTH-ISY-EX--13/4680--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Johannes Markström, "3D Position Estimation of a Person of Interest in Multiple Video Sequences: People Detection", Student thesis, LiTH-ISY-EX--13/4721--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Abstract

In most cases today when a specific person's whereabouts is monitored through video surveillance it is done manually and his or her location when not seen is based on assumptions on how fast he or she can move. Since humans are good at recognizing people this can be done accurately, given good video data, but the time needed to go through all data is extensive and therefore expensive. Because of the rapid technical development computers are getting cheaper to use and therefore more interesting to use for tedious work.

This thesis is a part of a larger project that aims to see to what extent it is possible to estimate a person of interest's time dependent 3D position, when seen in surveillance videos. The surveillance videos are recorded with non overlapping monocular cameras. Furthermore the project aims to see if the person of interest's movement, when position data is unavailable, could be predicted. The outcome of the project is a software capable of following a person of interest's movement with an error estimate visualized as an area indicating where the person of interest might be at a specific time.

This thesis main focus is to implement and evaluate a people detector meant to be used in the project, reduce noise in position measurement, predict the position when the person of interest's location is unknown, and to evaluate the complete project.

The project combines known methods in computer vision and signal processing and the outcome is a software that can be used on a normal PC running on a Windows operating system. The software implemented in the thesis use a Hough transform based people detector and a Kalman filter for one step ahead prediction. The detector is evaluated with known methods such as Miss-rate vs. False Positives per Window or Image (FPPW and FPPI respectively) and Recall vs. 1-Precision.

The results indicate that it is possible to estimate a person of interest's 3D position with single monocular cameras. It is also possible to follow the movement, to some extent, were position data are unavailable. However the software needs more work in order to be robust enough to handle the diversity that may appear in different environments and to handle large scale sensor networks.

Victor Johansson, "3D Position Estimation of a Person of Interest in Multiple Video Sequences: Person of Interest Recognition", Student thesis, LiTH-ISY-EX--13/4718--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Magnus Stigson, "Object Tracking Using Tracking-Learning-Detection inThermal Infrared Video", Student thesis, LiTH-ISY-EX--13/4668--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Martin Danelljan, "Visual Tracking", Student thesis, LiTH-ISY-EX--13/4736--SE, 2013.

AbstractKeywordsBiBTeXFulltext

Abstract

Visual tracking is a classical computer vision problem with many important applications in areas such as robotics, surveillance and driver assistance. The task is to follow a target in an image sequence. The target can be any object of interest, for example a human, a car or a football. Humans perform accurate visual tracking with little effort, while it remains a difficult computer vision problem. It imposes major challenges, such as appearance changes, occlusions and background clutter. Visual tracking is thus an open research topic, but significant progress has been made in the last few years.

The first part of this thesis explores generic tracking, where nothing is known about the target except for its initial location in the sequence. A specific family of generic trackers that exploit the FFT for faster tracking-by-detection is studied. Among these, the CSK tracker have recently shown obtain competitive performance at extraordinary low computational costs. Three contributions are made to this type of trackers. Firstly, a new method for learning the target appearance is proposed and shown to outperform the original method. Secondly, different color descriptors are investigated for the tracking purpose. Evaluations show that the best descriptor greatly improves the tracking performance. Thirdly, an adaptive dimensionality reduction technique is proposed, which adaptively chooses the most important feature combinations to use. This technique significantly reduces the computational cost of the tracking task. Extensive evaluations show that the proposed tracker outperform state-of-the-art methods in literature, while operating at several times higher frame rate.

In the second part of this thesis, the proposed generic tracking method is applied to human tracking in surveillance applications. A causal framework is constructed, that automatically detects and tracks humans in the scene. The system fuses information from generic tracking and state-of-the-art object detection in a Bayesian filtering framework. In addition, the system incorporates the identification and tracking of specific human parts to achieve better robustness and performance. Tracking results are demonstrated on a real-world benchmark sequence.

Kiran Kumar Budde, "A Matlab Toolbox for fMRI Data Analysis: Detection, Estimation and Brain Connectivity", Student thesis, LiTH-ISY-EX--12/4600--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Oscar Grandell, "An iterative reconstruction algorithm for quantitative tissue decomposition using DECT", Student thesis, LiTH-ISY-EX--12/4617--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Maria Schmiterlöw, "Autonomous Path Following Using Convolutional Networks", Student thesis, LiTH-ISY-EX--12/4577--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Anton Nordmark, "Kinect 3D Mapping", Student thesis, LiTH-ISY-EX--12/4636--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Fredrik Johansson, "Visual Stereo Odometry for Indoor Positioning", Student thesis, LiTH-ISY-EX--12/4621--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Henrik Wolkesson, "Realtime Mosaicing of Video Stream from µUAV", Student thesis, LiTH-ISY-EX--07/4140--SE, 2012.

AbstractKeywordsBiBTeXFulltext

Mattias Josefsson, "3D camera with built-in velocity measurement", Student thesis, LiTH-ISY-EX--11/4523--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Anette Karlsson, "In-Plane Motion Correction in Reconstruction of non-Cartesian 3D-functional MRI", Student thesis, LiTH-ISY-EX--11/4480--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Richard Wasell, "Automatisk detektering av diken i LiDAR-data", Student thesis, LiTH-ISY-EX--11/4524--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Tobias Lundqvist, "3D mapping with iPhone", Student thesis, LiTH-ISY-EX--11/4517--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Sebastian Möller, "Image Segmentation and Target Tracking using Computer Vision", Student thesis, LiTH-ISY-EX--11/4424--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, Fredrik Larsson, Han Wang, Anders Ynnerman, Thomas Schön, "Torchlight Navigation", LiTH-ISY-R, No. 3004, 2011.

AbstractKeywordsBiBTeXFulltext

Viola Thomasson, "Liver Tumor Segmentation Using Level Sets and Region Growing", Student thesis, LiTH-ISY-EX--11/4485--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Abstract

Medical imaging is an important tool for diagnosis and treatment planning today. However as the demand for efficiency increases at the same time as the data volumes grow immensely, the need for computer assisted analysis, such as image segmentation, to help and guide the practitioner increases.

Medical image segmentation could be used for various different tasks, the localization and delineation of pathologies such as cancer tumors is just one example. Numerous problems with noise and image artifacts in the generated images make the segmentation a difficult task, and the developer is forced to choose between speed and performance. In clinical practise, however, this is impossible as both speed and performance are crucial. One solution to this problem might be to involve the user more in the segmentation, using interactivite algorithms where the user might influence the segmentation for an improved result.

This thesis has concentrated on finding a fast and interactive segmentation method for liver tumor segmentation. Various different methods were explored, and a few were chosen for implementation and further development. Two methods appeared to be the most promising, Bayesian Region Growing (BRG) and Level Set.

An interactive Level Set algorithm emerged as the best alternative for the interactivity of the algorithm, and could be used in combination with both BRG and Level Set. A new data term based on a probability model instead of image edges was also explored for the Level Set-method, and proved to be more promising than the original one. The probability based Level Set and the BRG method both provided good quality results, but the fastest of the two was the BRG-method, which could segment a tumor present in 25 CT image slices in less than 10 seconds when implemented in Matlab and mex-C++ code on an ACPI x64-based PC with two 2.4 GHz Intel(R) Core(TM) 2CPU and 8 GB RAM memory. The interactive Level Set could be succesfully used as an interactive addition to the automatic method, but its usefulness was somewhat reduced by its slow processing time ( 1.5 s/slice) and the relative complexity of the needed user interactions.

Gustav Ahlman, "Improved Temporal Resolution Using Parallel Imaging in Radial-Cartesian 3D functional MRI", Student thesis, LiTH-ISY-EX--11/4470--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Abstract

MRI (Magnetic Resonance Imaging) is a medical imaging method that uses magnetic fields in order to retrieve images of the human body. This thesis revolves around a novel acquisition method of 3D fMRI (functional Magnetic Resonance Imaging) called PRESTO-CAN that uses a radial pattern in order to sample the (kx,kz)-plane of k-space (the frequency domain), and a Cartesian sample pattern in the ky-direction. The radial sample pattern allows for a denser sampling of the central parts of k-space, which contain the most basic frequency information about the structure of the recorded object. This allows for higher temporal resolution to be achieved compared with other sampling methods since a fewer amount of total samples are needed in order to retrieve enough information about how the object has changed over time. Since fMRI is mainly used for monitoring blood flow in the brain, increased temporal resolution means that we can be able to track fast changes in brain activity more efficiently.The temporal resolution can be further improved by reducing the time needed for scanning, which in turn can be achieved by applying parallel imaging. One such parallel imaging method is SENSE (SENSitivity Encoding). The scan time is reduced by decreasing the sampling density, which causes aliasing in the recorded images. The aliasing is removed by the SENSE method by utilizing the extra information provided by the fact that multiple receiver coils with differing sensitivities are used during the acquisition. By measuring the sensitivities of the respective receiver coils and solving an equation system with the aliased images, it is possible to calculate how they would have looked like without aliasing.In this master thesis, SENSE has been successfully implemented in PRESTO-CAN. By using normalized convolution in order to refine the sensitivity maps of the receiver coils, images with satisfying quality was able to be reconstructed when reducing the k-space sample rate by a factor of 2, and images of relatively good quality also when the sample rate was reduced by a factor of 4. In this way, this thesis has been able to contribute to the improvement of the temporal resolution of the PRESTO-CAN method.

Rikard Söderström, "An early fire detection system through registration and analysis of waste station IR-images", Student thesis, LiTH-ISY-EX--11/4354--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Fredrik Larsson, "Automatic 3D Model Construction for Turn-Table Sequences - A Simplification", LiTH-ISY-R, No. 3022, 2011.

AbstractKeywordsBiBTeXFulltext

Ema Ceco, "Image Analysis in the Field of Oil Contamination Monitoring", Student thesis, LITH-ISY-EX--11/4467--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Gustav Hanning, "Video Stabilization and Rolling Shutter Correction using Inertial Measurement Sensors", Student thesis, LiTH-ISY-EX--11/4464--SE, 2011.

AbstractKeywordsBiBTeXFulltext

David Sandberg, "Model-Based Video Coding Using a Colour and Depth Camera", Student thesis, LiTH-ISY-EX--11/4463--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Andreas Schöndell, "Evaluation of methods for segmentation of 3D range image data", Student thesis, LiTH-ISY-EX--11/4346--SE, 2011.

AbstractKeywordsBiBTeXFulltext

Peter Johansson, "Plant Condition Measurement from Spectral Reflectance Data", Student thesis, LiTH-ISY-EX--10/4369--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Anders Lind, "High-speed View Matching using Region Descriptors", Student thesis, LiTH-ISY-EX--10/4356--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Axel Landgren, "A robotic camera platform for evaluation of biomimetic gaze stabilization using adaptive cerebellar feedback", Student thesis, LiTH-ISY-EX--10/4351--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Hannes Holm Ovrén, Erika Emilsson, "Missile approach warning using multi-spectral imagery", Student thesis, LiTH-ISY-EX--10/4329--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Maria Magnusson, "Short on camera geometry and camera calibration", LiTH-ISY-R, No. 3070, 2010.

AbstractKeywordsBiBTeXFulltext

Fredrik Svensson, "Structure from Forward Motion", Student thesis, LiTH-ISY-EX--10/4364--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Alexander Tuttle, "Saliency Maps using Channel Representations", Student thesis, LITH-ISY-EX--10/4169--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Joel Molin, "Foreground Segmentation of Moving Objects", Student thesis, LiTH-ISY-EX–10/4299–SE, 2010.

AbstractKeywordsBiBTeXFulltext

Kristoffer Öfjäll, "LEAP, A Platform for Evaluation of Control Algorithms", Student thesis, LiTH-ISY-EX--10/4370--SE, 2010.

AbstractKeywordsBiBTeXFulltext

Fredrik Viksten, Per-Erik Forssén, "Maximally Robust Range Regions", LiTH-ISY-R, No. 2961, 2010.

AbstractKeywordsBiBTeX

Klas Nordberg, Fredrik Viksten, "A local geometry based descriptor for 3D data: Addendum on rank and segment extraction", LiTH-ISY-R, No. 2951, 2010.

AbstractKeywordsBiBTeX

Mattias Lennartsson, "Object Recognition with Cluster Matching", Student thesis, LITH-ISY-EX--09/4152--SE, 2009.

AbstractKeywordsBiBTeXFulltext

Lina Stuhr, "Grain Reduction in Scanned Image Sequences under Time Constraints", Student thesis, LiTH-ISY-EX--09/4203--SE, 2009.

AbstractKeywordsBiBTeXFulltext

Marcus Wallenberg, "A Single-Camera Gaze Tracker using Controlled Infrared Illumination", Student thesis, LITH-ISY-EX--09/4199--SE, 2009.

AbstractKeywordsBiBTeXFulltext

Michael Westberg, "Time of Flight Based Teat Detection", Student thesis, LiTH-ISY-EX--09/4154 --SE, 2009.

AbstractKeywordsBiBTeXFulltext

Mikael Karelid, "Image Enhancement over a Sequence of Images", Student thesis, LiTH-ISY-EX--08/4013--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Erik Jonsson, "Object Recognition using Channel-Coded Feature Maps: C++ Implementation Documentation", LiTH-ISY-R, No. 2838, 2008.

AbstractKeywordsBiBTeXFulltext

Markus Olgemar, "Camera Based Navigation: Matching between Sensor reference and Video image", Student thesis, LITH-ISY-EX--08/4170--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Erik Ringaby, "Optical Flow Computation on Compute Unified Device Architecture", Student thesis, LiTH-ISY-EX--08/4043--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Marcus Lundagårds, "Vehicle Detection in Monochrome Images", Student thesis, LiTH-ISY-EX--08/4148--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Martin Berg, "Pose Recognition for Tracker Initialization Using 3D Models", Student thesis, LiTH-ISY-EX--07/4076--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Per Thyr, "Method for Acquisition and Reconstruction of non-Cartesian 3-D fMRI", Student thesis, LITH-ISY-EX--08/4058--SE, 2008.

AbstractKeywordsBiBTeXFulltext

Lars Arvidsson, "Stereoseende i realtid", Student thesis, LITH-ISY-EX--07/3944--SE, 2007.

AbstractKeywordsBiBTeXFulltext

Björn Benderius, "Laser Triangulation Using Spacetime Analysis", Student thesis, LITH-ISY-EX--07/4047--SE, 2007.

AbstractKeywordsBiBTeXFulltext

Johan Hallenberg, "Robot Tool Center Point Calibration using Computer Vision", Student thesis, LiTH-ISY-EX-- 07/3943--SE, 2007.

AbstractKeywordsBiBTeXFulltext

John Wood, "Statistical Background Models with Shadow Detection for Video Based Tracking", Student thesis, LITH-ISY-EX--07/3921--SE, 2007.

AbstractKeywordsBiBTeXFulltext

Johan Borg, "Detecting and Tracking Players in Football Using Stereo Vision", Student thesis, LiTH−ISY−EX--07/3535--SE, 2007.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, Johan Wiklund, Erik Jonsson, Anders Moe, Gösta Granlund, "Exploratory Learning Structure in Artificial Cognitive Systems", LiTH-ISY-R, No. 2738, 2006.

AbstractKeywordsBiBTeXFulltext

Abstract

One major goal of the COSPAL project is to develop an artificial cognitive system architecture with the capability of exploratory learning. Exploratory learning is a strategy that allows to apply generalization on a conceptual level, resulting in an extension of competences. Whereas classical learning methods aim at best possible generalization, i.e., concluding from a number of samples of a problem class to the problem class itself, exploration aims at applying acquired competences to a new problem class. Incremental or online learning is an inherent requirement to perform exploratory learning.

Exploratory learning requires new theoretic tools and new algorithms. In the COSPAL project, we mainly investigate reinforcement-type learning methods for exploratory learning and in this paper we focus on its algorithmic aspect. Learning is performed in terms of four nested loops, where the outermost loop reflects the user-reinforcement-feedback loop, the intermediate two loops switch between different solution modes at symbolic respectively sub-symbolic level, and the innermost loop performs the acquired competences in terms of perception-action cycles. We present a system diagram which explains this process in more detail.

We discuss the learning strategy in terms of learning scenarios provided by the user. This interaction between user (’teacher’) and system is a major difference to most existing systems where the system designer places his world model into the system. We believe that this is the key to extendable robust system behavior and successful interaction of humans and artificial cognitive systems.

We furthermore address the issue of bootstrapping the system, and, in particular, the visual recognition module.We give some more in-depth details about our recognition method and how feedback from higher levels is implemented. The described system is however work in progress and no final results are available yet. The available preliminary results that we have achieved so far, clearly point towards a successful proof of the architecture concept.

Hans Brolund, "Förbättring av fluoroskopibilder", Student thesis, LITH-ISY-EX-06/3823-SE, 2006.

AbstractKeywordsBiBTeXFulltext

Christer Norström, "Underwater 3-D imaging with laser triangulation", Student thesis, LiTH-ISY-EX--06/3851--SE, 2006.

AbstractKeywordsBiBTeXFulltext

Jonas Dehlin, Joakim Löf, "Dynamic Infrared Simulation: A Feasibility Study of a Physically Based Infrared Simulation Model", Student thesis, LITH-ISY-EX--06/3815--SE, 2006.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, Anders Moe, "Object Recognition in 3D Laser Radar Data using Plane triplets", LiTH-ISY-R, No. 2708, 2005.

AbstractKeywordsBiBTeXFulltext

Robin Björling, "Denoising of Infrared Images Using Independent Component Analysis", Student thesis, LiTH-ISY-EX--05/3726--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Mattias Sonesson, "A Probabilistic Approach to Conceptual Sensor Modeling", Student thesis, LITH-ISY-EX-3428-2004, 2005.

AbstractKeywordsBiBTeXFulltext

Staffan Håkansson, "Detektering av sprickor i vägytor med hjälp av Datorseende", Student thesis, LITH-ISY-EX--05/3699--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Mathias Andersson, "Image processing algorithms for compensation of spatially variant blur", Student thesis, LITH-ISY-EX--05/3633--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssén, Björn Johansson, Gösta Granlund, "Learning under Perceptual Aliasing", LiTH-ISY-R-2705, 2005.

KeywordsBiBTeX

Per-Erik Forssen, Anders Moe, "Contour Descriptors for View-Based Object Recognition", LiTH-ISY-R, No. 2706, 2005.

AbstractKeywordsBiBTeX

Erik Jonsson, Michael Felsberg, Gösta Granlund, "Incremental Associative Learning", LiTH-ISY-R, No. 2691, 2005.

AbstractKeywordsBiBTeX

Olle Seger, Maria Magnusson Seger, "The MATLAB/C program take - a program for simulation of X-ray projections from 3D volume data. Demonstration of beam-hardening artefacts in subsequent CT reconstruction.", LiTH-ISY-R, No. 2682, 2005.

AbstractKeywordsBiBTeXFulltext

Abstract

The MATLAB/C program take version 3.1 is a program for simulation of X-ray projections from 3D volume data. It is based on an older C version by Muller-Merbach as well as an extended C version by Turbell. The program can simulate 2D X-ray projections from 3D objects. These data can then be input to 3D reconstruction algorithms. Here however, we only demonstrate a couple of 2D reconstruction algorithms, written in MATLAB. Simple MATLAB examples show how to generate the take projections followed by subsequent reconstruction. Compared to the old take version, the C code have been carefully revised. A preliminary, rather untested feature of using a polychromatic X-ray source with different energy levels was already included in the old take version. The current polychromatic feature X-ray is however carefully tested. For example, it has been compared with the results from the program described by Malusek et al. We also demonstrate experiments with a polychromatic X-ray source and a Plexiglass object giving the beam-hardening artefact. Detector sensitivity for different energy levels is not included in take. However, in section~\refsec:realexperiment, we describe a technique to include the detector sensitivity into the energy spectrum. Finally, an experiment with comparison of real and simulated data were performed. The result wasn't completely successful, but we still demonstrate it. Contemporary analytical reconstruction methods for helical cone-beam CT have to be designed to handle the Long Object Problem. Normally, a moderate amount of over-scanning is sufficient for reconstruction of a certain Region-of-interest (ROI). Unfortunately, for iterative methods, it seems that the useful ROI will diminish for every iteration step. The remedies proposed here are twofold. Firstly, we use careful extrapolation and masking of projection data. Secondly, we generate and utilize projection data from incompletely reconstructed volume parts, which is rather counter-intuitive and contradictory to our initial assumptions. The results seem very encouraging. Even voxels close to the boundary in the original ROI are as well enhanced by the iterative loop as the middle part.

Gerald Sommer, Gösta Granlund, Oliver Granert, Martin Krause, Klas Nordberg, Christian Perwass, Robert Söderberg, Fredrik Viksten, Marco Chavarria, "Information Society Technologies (IST) programme: Final Report", -, 2005.

AbstractKeywordsBiBTeXFulltext

Gabriella Gustafsson, "Multiphase Motion Estimation in a Two Phase Flow", Student thesis, LITH-ISY-EX--05/3723--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Wilhelm Isoz, "Calibration of Multispectral Sensors", Student thesis, LiTH-ISY-EX--05/3651--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Gunnar Hedlund, "Närmaskbestämning från stereoseende", Student thesis, LiTH-ISY-EX--05/3623--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Adam Andersson, "Range Gated Viewing with Underwater Camera", Student thesis, LITH-ISY-EX--05/3718--SE, 2005.

AbstractKeywordsBiBTeXFulltext

Björn Wernersson, Mikael Södergren, "Automatiserad inlärning av detaljer för igenkänning och robotplockning", Student thesis, LiTH-ISY-EX--05/3755—SE, 2005.

AbstractKeywordsBiBTeXFulltext

Johan Sunnegårdh, "Iterative Enhancement of Non-Exact Reconstruction in Cone Beam CT", Student thesis, LITH-ISY-EX--04/3646--SE, 2004.

AbstractKeywordsBiBTeXFulltext

Johan Schultz, "Sensordatafusion av IR- och radarbilder", Student thesis, LiTH-ISY-Ex No. 3475, 2004.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, Tommy Elfving, Vladimir Kozlov, Yair Censor, Gösta Granlund, "The Application of an Oblique-Projected Landweber Method to a Model of Supervised Learning", LiTH-ISY-R, No. 2623, 2004.

AbstractKeywordsBiBTeX

Per-Erik Forssen, Anders Moe, "Automatic Estimation of Epipolar Geometry from Blob Features", LiTH-ISY-R, No. 2620, 2004.

AbstractKeywordsBiBTeXFulltext

Klas Nordberg, "A fourth order tensor for representation of orientation and position of oriented segments", LiTH-ISY-R, No. 2587, 2004.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, Per-Erik Forssen, Hanno Scharr, "Efficient Robust Smoothing of Low-Level Signal Features", LiTH-ISY-R, No. 2619, 2004.

AbstractKeywordsBiBTeX

Michael Felsberg, Per-Erik Forssen, Hanno Scharr, "B-Spline Channel Smoothing for Robust Estimation", LiTH-ISY-R, No. 2579, 2004.

AbstractKeywordsBiBTeXFulltext

Per-Erik Danielsson, Maria Magnusson Seger, "Combining Fourier and iterative methods in computer tomography: Analysis of an iteration scheme. The 2D-case", LiTH-ISY-R, No. 2634, 2004.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, Robert Söderberg, "A Repeatability Test for Two Orientation Based Interest Point Detectors", LiTH-ISY-R, No. 2606, 2004.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, "The GET Operator", LiTH-ISY-R, No. 2633, 2004.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, Anders Moe, "Patch-Duplets for Object Recognition and Pose Estimation", LiTH-ISY-R, No. 2553, 2003.

AbstractKeywordsBiBTeXFulltext

Mårten Björk, Sofia Max, "ARTSY: A Reproduction Transaction System", Student thesis, LiTH-ISY-Ex No. 3262, 2003.

AbstractKeywordsBiBTeXFulltext

Per Öberg, "Tracking by Image Processing in a Real Time System", Student thesis, LiTH-ISY-Ex No. 3374, 2003.

AbstractKeywordsBiBTeXFulltext

Elisabeth Ågren, "Lateral Position Detection Using a Vehicle-Mounted Camera", Student thesis, LiTH-ISY-Ex No. 3417, 2003.

AbstractKeywordsBiBTeXFulltext

Petter Torle, "Scene-based correction of image sensor deficiencies", Student thesis, LiTH-ISY-Ex No. 3350, 2003.

AbstractKeywordsBiBTeXFulltext

Niklas Dahlbäck, "Implementation of a fast method for reconstruction of ISAR images", Student thesis, LiTH-ISY-Ex No. 3437, 2003.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, Norbert Kruger, "A Probabilistic Definition of Intrinsic Dimensionality for Images", LiTH-ISY-R, No. 2520, 2003.

AbstractKeywordsBiBTeX

Hagen Spies, "Covariances of Linear Filter Outputs in Computer Vision", LiTH-ISY-R, No. 2504, 2003.

AbstractKeywordsBiBTeXFulltext

Hagen Spies, "Gradient Channel Matrices for Orientation Estimation", LiTH-ISY-R, No. 2540, 2003.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, "Representing Multiple Orientations in 2D with Orientation Channel Histograms", LiTH-ISY-R, No. 2475, 2002.

AbstractKeywordsBiBTeXFulltext

Mikael Svensson, "Utveckling av styrning till solföljande MaReCo-hybrid i Hammarby Sjöstad", Student thesis, LiTH-ISY-Ex No. 3193, 2002.

AbstractKeywordsBiBTeXFulltext

Marcus Isaksson, "Face Detection and Pose Estimation using Triplet Invariants", Student thesis, LiTH-ISY-Ex No. 3223, 2002.

AbstractKeywordsBiBTeXFulltext

Sara Molin, "Förbättring av upplösningen i Landsat 7-bilder med hjälp av bildfusion", Student thesis, LiTH-ISY-Ex No. 3229, 2002.

AbstractKeywordsBiBTeXFulltext

Per Mattsson, Andreas Eriksson, "Segmentation of Carotid Arteries from 3D and 4D Ultrasound Images", Student thesis, LiTH-ISY-Ex No. 3279, 2002.

AbstractKeywordsBiBTeXFulltext

Håkan Bjurström, Jon Svensson, "Assessment of Grapevine Vigour Using Image Processing", Student thesis, LiTH-ISY-Ex No. 3293, 2002.

AbstractKeywordsBiBTeXFulltext

Andreas Böckert, "Vehicle detection and classification in video sequences", Student thesis, LiTH-ISY-Ex No. 3270, 2002.

AbstractKeywordsBiBTeXFulltext

Andreas Eidehall, "Tensor representation of 3D structures", Student thesis, LiTH-ISY-Ex No. 3271, 2002.

AbstractKeywordsBiBTeXFulltext

Gösta Granlund, Per-Erik Forssén, Björn Johansson, "HiperLearn: A High Performance Learning Architecture", LiTH-ISY-R, No. 2409, 2002.

AbstractKeywordsBiBTeX

Per-Erik Forssen, "Observations Concerning Reconstructions with Local Support", LiTH-ISY-R, No. 2425, 2002.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssen, Gösta Granlund, Johan Wiklund, "Channel Representation of Colour Images", LiTH-ISY-R, No. 2418, 2002.

AbstractKeywordsBiBTeXFulltext

Michael Felsberg, Gerald Sommer, "The Poisson Scale-Space: A Unified Approach to Phase-Based Image Processing in Scale-Space", LiTH-ISY-R, No. 2453, 2002.

AbstractKeywordsBiBTeX

Michael Felsberg, Hanno Scharr, Per-Erik Forssen, "The B-Spline Channel Representation: Channel Algebra and Channel Based Diffusion Filtering", LiTH-ISY-R, No. 2461, 2002.

AbstractKeywordsBiBTeXFulltext

Klas Nordberg, "The structure tensor in projective spaces", LiTH-ISY-R, No. 2424, 2002.

AbstractKeywordsBiBTeXFulltext

Per-Erik Danielsson, Maria Magnusson Seger, Henrik Turbell, "The PI-methods for Helical Cone-Beam Tomography", LiTH-ISY-R, No. 2428, 2002.

AbstractKeywordsBiBTeXFulltext

Per Nordlöv, "Implementation Aspects of Image Processing", Student thesis, LiTH-ISY-Ex No. 3088, 2001.

AbstractKeywordsBiBTeXFulltext

Oskar Söderkvist, "Computer Vision Classification of Leaves from Swedish Trees", Student thesis, LiTH-ISY-Ex No. 3132, 2001.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, "On Sparse Associative Networks: A Least Squares Formulation", LiTH-ISY-R, No. 2368, 2001.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, "On Classification: Simultaneously Reducing Dimensionality and Finding Automatic Representation using Canonical Correlation", LiTH-ISY-R, No. 2375, 2001.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssen, "Autonomous Navigation using Active Perception", LiTH-ISY-R, No. 2395, 2001.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssen, "Window Matching using Sparse Templates", LiTH-ISY-R, No. 2392, 2001.

AbstractKeywordsBiBTeXFulltext

Klas Nordberg, Gunnar Farnebäck, "Rank complement of diagonalizable matrices using polynomial functions", LiTH-ISY-R, No. 2369, 2001.

AbstractKeywordsBiBTeXFulltext

Marcus Klomark, "Occupant Detection using Computer Vision", Student thesis, LiTH-ISY-Ex No. 3026, 2000.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, "A Survey on: Contents Based Search in Image Databases", LiTH-ISY-R, No. 2215, 2000.

AbstractKeywordsBiBTeXFulltext

Björn Johansson, "Backprojection of Some Image Symmetries Based on a Local Orientation Description", LiTH-ISY-R, No. 2311, 2000.

AbstractKeywordsBiBTeXFulltext

Gösta H. Granlund, "Context Controllable Linkage Models", LiTH-ISY-R, No. 2238, 2000.

KeywordsBiBTeX

Gösta H. Granlund, "Learning Through Response-Driven Association", LiTH-ISY-R, No. 2237, 2000.

KeywordsBiBTeX

Gösta H. Granlund, "Low Level Image Interpretation Using Associative Mapping", LiTH-ISY-R, No. 2239, 2000.

KeywordsBiBTeX

Gösta Granlund, "The Dichotomy of Strategies for Spatial-Cognitive Information Processing", LiTH-ISY-R, No. 2241, 2000.

KeywordsBiBTeX

Björn Johansson, "Curvature Detection using Polynomial Fitting on Local Orientation", LiTH-ISY-R, No. 2312, 2000.

AbstractKeywordsBiBTeXFulltext

Gösta H. Granlund, "Channel Representation of Information", LiTH-ISY-R, No. 2236, 2000.

KeywordsBiBTeX

Per-Erik Forssen, "Updating Camera Location and Heading using a Sparse Displacement Field", LiTH-ISY-R, No. 2318, 2000.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssen, Björn Johansson, "Fractal Coding by Means of Local Feature Histograms", LiTH-ISY-R, No. 2295, 2000.

AbstractKeywordsBiBTeXFulltext

Gösta H. Granlund, "The Use of Dynamics to Establish Knowledge of Invariant Structure", LiTH-ISY-R, No. 2240, 2000.

KeywordsBiBTeX

Frans Lundberg, "Maximum Entropy Matching: An Approach to Fast Template Matching", LiTH-ISY-R, No. 2313, 2000.

AbstractKeywordsBiBTeXFulltext

Abstract

One important problem in image analysis is the localization of a template in a larger image. Applications where the solution of this problem can be used include: tracking, optical flow, and stereo vision. The matching method studied here solve this problem by defining a new similarity measurement between a template and an image neighborhood. This similarity is computed for all possible integer positions of the template within the image. The position for which we get the highest similarity is considered to be the match. The similarity is not necessarily computed using the original pixel values directly, but can of course be derived from higher level image features.

The similarity measurement can be computed in differentways and the simplest approach are correlation-type algorithms. Aschwanden and Guggenb¨uhl [2] have done a comparison between such algorithms. One of best and simplest algorithms they tested is normalized cross-correlation (NCC). Therefore this algorithm has been used to compare with the PAIRS algorithm that is developed by the author and described in this text. It uses a completely different similarity measurement based on sets of bits extracted from the template and the image.

This work is done withinWITAS which is a project dealing with UAV’s (unmanned aerial vehicles). Two specific applications of the developed template matching algorithm have been studied.

One application is tracking of cars in video sequences from a helicopter.
The other one is computing optical flow in such video sequences in order to detect moving objects, especially vehicles on roads.

The video from the helicopter is in color (RGB) and this fact is used in the presented tracking algorithm. The PAIRS algorithm have been applied to these two applications and the results are reported.

A part of this text will concern a general approach to template matching called Maximum Entropy Matching (MEM) that is developed here. The main idea of MEM is that the more data we compare on a computer the longer it takes and therefore the data that we compare should have maximum average information, that is, maximum entropy. We will see that this approach can be useful to create template matching algorithms which are in the order of 10 times faster then correlation (NCC) without decreasing the performance.

Stefan Langemark, "GIS in a simulator environment and efficient inverse mapping of roads", Student thesis, LiTH-ISY-Ex No. 2090, 1999.

AbstractKeywordsBiBTeXFulltext

Abstract

This thesis investigates the possibilities of using GIS (Geographic Information System) data with an airborne autonomous vehicle developed in the WITAS project. Available for the thesis are high resolution (0.16 meter sample interval) aerial photographs over Stockholm, and vector data in a common GIS format containing all roads in the Stockholm area.

A method for removing cars from aerial photographs is presented, using the filtering method normalized convolution, originally developed for filtering uncertain and incomplete data. By setting the certainty to zero over the cars, this data is disregarded in the filtering process, resulting in an image without cars. This method is further improved by choosing an anisotropic applicability function, resulting in a filtering that preserves structures oriented in certain directions.

The available vector data is investigated with regard to its use in a simulator for vehicle movement, and is found to be missing much of the essential information needed in such a simulator. A new data format better suited to these requirements is created, using the extensible markup language (XML) which generates a humanreadable data format and can use existing parsers to make the implementation simpler. The result is a somewhat complex, but highly general data format that can accurately express almost any type of road and intersection. Cars can follow arbitrary paths in the road database and move with a smooth motion suitable for use as input to image processing equipment. The simulator does not allow any dynamic behaviour such as changing speeds, starting or stopping, or interaction between cars, takeovers or intelligent behavior in intersections.

In the airborne vehicle, a mapping from pixels in a camera image (like the ones output from the simulator) to locations in the road database is needed. This is an inverse mapping with respect to visualizing as described above. This gives important information to a car tracking system regarding the probable movement of cars and also making it possible to determine if a car breaks traffic regulations. A mapping of this kind is created using a simplified form of ray tracing known as ray casting, together with space partitioning methods used to vastly improve efficiency.

All above mentioned tasks are implemented using C++ and object oriented methods, giving maintainable and extendable code suiting a quickly changing research area. The interface to the simulator is designed to be compatible to the existing simulation software used in the WITAS project. Visualization is done through the OpenGL graphics library, providing realistic effects such as lighting and shading.

Urban Bergquist, "Colour Vision and Hue for Autonomous Vehicle Guidance", Student thesis, LiTH-ISY-Ex No. 2091, 1999.

AbstractKeywordsBiBTeXFulltext

Robert Stewing, "Parameterprediktering med multipla sammansatta lokala neuronnätsbaserade modeller vid framställning av pappersmassa", Student thesis, LiTH-ISY-Ex No. 1991, 1999.

AbstractKeywordsBiBTeXFulltext

Jakob Roll, "A System for Visual-Based Automated Storage Robots", Student thesis, LiTH-ISY-Ex No. 2053, 1999.

AbstractKeywordsBiBTeXFulltext

Thord Andersson, Silvia Coradeschi, Alessandro Saffiotti, "Fuzzy matching of visual cues in an unmanned airborne vehicle", -, 1999.

AbstractKeywordsBiBTeXFulltext

Todd Reed, "A Baseline System for Image and Map Registration using Sparse Hierarchical Features", LiTH-ISY-R, No. 2138, 1999.

KeywordsBiBTeX

Anders Moe, "Investigations in Tracking and Colour Classification", Student thesis, LiTH-ISY-Ex No. 1967, 1998.

AbstractKeywordsBiBTeXFulltext

Mats Andersson, Johan Wiklund, Hans Knutsson, "Sequential Filter Trees for Efficient 2D 3D and 4D Orientation Estimation", LiTH-ISY-R, No. 2070, 1998.

AbstractKeywordsBiBTeXFulltext

Hans Knutsson, Magnus Borga, Tomas Landelius, "Learning Multidimensional Signal Processing", LiTH-ISY-R, No. 2039, 1998.

AbstractKeywordsBiBTeX

Morgan Ulvklo, Gösta H. Granlund, Hans Knutsson, "Adaptive Reconstruction using Multiple Views", LiTH-ISY-R, No. 2036, 1998.

AbstractKeywordsBiBTeXFulltext

Magnus Borga, Hans Knutsson, "An Adaptive Stereo Algorithm Based on Canonical Correlation Analysis", LiTH-ISY-R, No. 2013, 1998.

KeywordsBiBTeX

Gösta Granlund, "Does Vision Inevitably Have to be Active?", LiTH-ISY-R, No. 2068, 1998.

KeywordsBiBTeX

Thord Andersson, Mikael Karlsson, "Neuronnätsbaserad identifiering av processparametrar vid tillverkning av pappersmassa", Student thesis, LiTH-ISY-Ex No. 1709, 1997.

AbstractKeywordsBiBTeXFulltext

Abstract

Artificiella neurala nätverk (ANN) är en teknik som under de senaste tio åren har mognat och som numera återfinns i allt fler tillämpningar så som avläsning av skriven text, linjär programmering, reglerteknik, expertsystem, taligenkänning och många olika sorters klassificeringsproblem [Zurada, 1992]. Vi ville i vårt examensarbete försöka använda ANN i en industriell process där standardmetoder ej fungerat tillfredsställande eller varit svåra att tillämpa. En sådan process fann vi i tillverkningen av pappersmassa.

För att tillverka pappersmassa från ved krävs en lång och komplicerad process uppdelad i flera olika steg. Ett av dessa steg är den så kallade kokningen där man med hjälp av högt tryck och varm lut bryter ned träflis till fibrer. Kokningsprocessen är komplex, pågår under lång tid (ca. 8 timmar) samt påverkas av en stor mängd parametrar och därför krävs det stor erfarenhet och kunskap för att kunna styra den. På Kværner Pulping Technologies i Karlstad, som konstruerar bl.a. kokare, har man tagit fram en simulator för kokningsprocessen för att man skall få en bättre insikt i hur processen fungerar och följaktligen kunna styra kokningen på ett bättre sätt. Simulatorns beteende är beroende av ett antal s.k. dolda parametrar som är en delmängd av de parametrar som antas påverka kokningsprocessen. Dessa dolda parametrar är svåra/omöjliga att mäta och därför sätts dessa i simuleringen till estimerade värden. De, i den riktiga processen, motsvarande dolda parametrarna varierar dock på ett okänt sätt. De påverkas dels av interna processer i kokaren, dels av externa orsaker, t.ex. kan träflis av en annan kvalitet matas in i kokaren. Detta leder till simulatorn ger bra simuleringar under ganska kort tid då de dolda parametrarna är approximativt konstanta.

Om man på något sätt skulle kunna detektera förändringarna i de dolda parametrarna i processen och föra över dessa till simulatorn, skulle den kunna gå "parallellt" med kokprocessen. Simulatorn skulle i detta fall utgöra ett utmärkt kompletterande verktyg för den person som styr kokprocessen, eftersom han/hon skulle få en bättre uppfattning om vad som händer/hände i processen och därmed få ett större beslutsunderlag för styrning. Detta förutsätter att simulatorn är så pass bra att den under stationära förhållanden i parametrarna lyckas fånga den globala utvecklingen i kokaren med tillräcklig precision.

Som ett första steg för att nå detta mål avser vi i denna rapport att undersöka om detektering av förändringar i de dolda parametrarna i simulatorn är möjlig med hjälp av framåtkopplade ANN och inlärningsalgoritmen resilient propagation.

Rapporten är uppdelad i 7 kapitel där vi i kapitel 2 kommer behandla problemet mer i detalj. Kapitel 3 och 4 är av allmänt slag där vi beskriver tillverkningsprocessen för papper och vad artificiella neurala nätverk egentligen är. I kapitel 5 beskriver vi de olika lösningsförslag som behandlats och de resultat vi har uppnått. Slutsatser och resultat sammanfattas i kapitel 6 . Det finns mycket mer vi skulle vilja pröva på och undersöka, dessa fortsatta arbeten beskriver vi kapitel 7. Sist i rapporten kommer bilagorna 1 och 2 med detaljer som vi finner relevanta, men som är för skrymmande att ta med i huvuddelen av rapporten. I bilaga 3 har vi bifogat den programkod vi producerat under arbetets gång.

Björn Johansson, "Multidimensional signal recognition, invariant to affine transformation and time-shift, using canonical correlation", Student thesis, LiTH-ISY-EX-1825, 1997.

AbstractKeywordsBiBTeXFulltext

Claes Lundström, "Segmentation of Medical Image Volumes", Student thesis, LiTH-ISY-Ex No. 1864, 1997.

AbstractKeywordsBiBTeXFulltext

Per-Erik Forssén, "Detection of Man-made Objects in Satellite Images", Student thesis, LiTH-ISY-Ex No. 1852, 1997.

AbstractKeywordsBiBTeXFulltext

Morgan Ulvklo, Magnus Uppsäll, "Adaptive Reconstruction using Multiple Views - Results and Applications", -, 1997.

KeywordsBiBTeX

Jörgen Karlholm, "Tracking of occluded targets in head-up display sequences", LiTH-ISY-R, No. 1993, 1997.

KeywordsBiBTeX

Magnus Borga, Tomas Landelius, Hans Knutsson, "A Unified Approach to PCA, PLS, MLR and CCA", LiTH-ISY-R, No. 1992, 1997.

AbstractKeywordsBiBTeXFulltext

Jörgen Ahlberg, "Active Contours in Three Dimensions", Student thesis, LiTH-ISY-Ex No. 1708, 1996.

AbstractKeywordsBiBTeXFulltext

Gunnar Farnebäck, "Motion-based segmentation of image sequences", Student thesis, LiTH-ISY-Ex No. 1596, 1996.

AbstractKeywordsBiBTeXFulltext

Gunnar Farnebäck, Hans Knutsson, Gösta Granlund, "Detection of point-shaped targets", LiTH-ISY-R, No. 1921, 1996.

AbstractKeywordsBiBTeXFulltext

Hans Knutsson, Magnus Borga, Tomas Landelius, "Generalized Eigenproblem for Stochastic Process Covariances", LiTH-ISY-R, No. 1916, 1996.

AbstractKeywordsBiBTeXFulltext

Tomas Landelius, Magnus Borga, Hans Knutsson, "Reinforcement Learning Trees", LiTH-ISY-R, No. 1828, 1996.

AbstractKeywordsBiBTeXFulltext

Johan Wiklund, Hans Knutsson, "A Generalized Convolver", LiTH-ISY-R, No. 1830, 1996.

AbstractKeywordsBiBTeXFulltext

Tomas Landelius, Hans Knutsson, "Greedy adaptive critics for LPQ [dvs LQR] problems: Convergence Proofs", LiTH-ISY-R, No. 1896, 1996.

AbstractKeywordsBiBTeXFulltext

Tomas Landelius, Hans Knutsson, "Reinforcement Learning Adaptive Control and Explicit Criterion Maximization", LiTH-ISY-R, No. 1829, 1996.

AbstractKeywordsBiBTeXFulltext

Tomas Landelius, Hans Knutsson, Magnus Borga, "On-Line Singular Value Decomposition of Stochastic Process Covariances", LiTH-ISY-R, No. 1762, 1995.

AbstractKeywordsBiBTeXFulltext

Hans Knutsson, Magnus Borga, Tomas Landelius, "Learning Canonical Correlations", LiTH-ISY-R, No. 1761, 1995.

AbstractKeywordsBiBTeXFulltext

Klas Nordberg, Hans Knutsson, Gösta Granlund, "Local Curvature from Gradients of the Orientation Tensor Field", LiTH-ISY-R, No. 1783, 1995.

AbstractKeywordsBiBTeXFulltext

Roland Wilson, Hans Knutsson, "Seeing Things II", LiTH-ISY-R, No. 1787, 1995.

KeywordsBiBTeX

Tomas Landelius, Hans Knutsson, "A Dynamic Tree Structure for Incremental Reinforcement Learning of Good Behavior", LiTH-ISY-R, No. 1628, 1994.

AbstractKeywordsBiBTeXFulltext

Magnus Borga, Hans Knutsson, "A Binary Competition Tree for Reinforcement Learning", LiTH-ISY-R, No. 1623, 1994.

AbstractKeywordsBiBTeXFulltext

Klas Nordberg, Gösta Granlund, Hans Knutsson, "Representation and Learning of Invariance", LiTH-ISY-R, No. 1552, 1994.

AbstractKeywordsBiBTeXFulltext

Carl-Fredrik Westin, Carl-Johan Westelius, Johan Wiklund, Hans Knutsson, Gösta Granlund, "ESPRIT Basic Research Action 7108, Vision as Process, DR.B.2: Integration of Multi-level Control Loops and FOA", -, 1994.

KeywordsBiBTeX

Jörgen Karlholm, Carl-Johan Westelius, Carl-Fredrik Westin, Hans Knutsson, "Object Tracking Based on the Orientation Tensor Concept", LiTH-ISY-R, No. 1658, 1994.

AbstractKeywordsBiBTeXFulltext

Mats T. Andersson, Hans Knutsson, "Controllable 3-D Filters for Low Level Computer Vision", LiTH-ISY-R, No. 1526, 1993.

AbstractKeywordsBiBTeXFulltext

Gösta Granlund, "ESPRIT Project BRA 3038: Vision as Process, Final Report", LiTH-ISY-R, No. 1473, 1993.

KeywordsBiBTeX

Klas Nordberg, Hans Knutsson, Gösta Granlund, "On the Equivariance of the Orientation and the Tensor Field Representation", LiTH-ISY-R, No. 1530, 1993.

AbstractKeywordsBiBTeXFulltext

Carl-Fredrik Westin, Carl-Johan Westelius, "ESPRIT Basic Research Action 7108, Vision as Process, DR.B.1: Integration of Low-level FOA \& Control Mechanisms", -, 1993.

KeywordsBiBTeX

Rasmus Larsen, "Thoughts on Bayesian Estimation of Motion Vector Fields", LiTH-ISY-R, No. 1521, 1993.

KeywordsBiBTeX

Erik Granum et, "ESPRIT Basic Research Action 7108, Vision as Process, Periodic progress report", -, 1993.

KeywordsBiBTeX

Johan Wiklund, Carl-Fredrik Westin, Carl-Johan Westelius, "AVS, Application Visualization System, Software Evaluation Report", LiTH-ISY-R, No. 1469, 1993.

KeywordsBiBTeX

Roland Wilson, Hans Knutsson, "Seeing Things [1]", LiTH-ISY-R, No. 1467, 1993.

KeywordsBiBTeX

Klas Nordberg, "Signal Representation and Signal Processing using Operators", LiTH-ISY-I, No. 1387, 1992.

AbstractKeywordsBiBTeXFulltext

Magnus Borga, Tomas Carlsson, "A Survey of Current Techniques for Reinforcement Learning", LiTH-ISY-I, No. 1391, 1992.

AbstractKeywordsBiBTeXFulltext

Carl-Johan Westelius, "ESPRIT Basic Research Action 3038, Vision as Process, DS.A.2.1: Software for Model Support and Local FOA Control", -, 1992.

KeywordsBiBTeX

Carl-Fredrik Westin, "ESPRIT Basic Research Action 3038, Vision as Process, DR.A.2.1: Model Support and Local FOA Control", -, 1992.

KeywordsBiBTeX

Carl-Johan Westelius, Hans Knutsson, Johan Wiklund, "Robust Vergence Control Using Scale--Space Phase Information", LiTH-ISY-I, No. 1363, 1992.

KeywordsBiBTeX

Håkan Bårman, Hans Knutsson, Gösta H. Granlund, "A Note on Estimation of Optical Flow and Acceleration", LiTH-ISY-I, No. 1313, 1992.

KeywordsBiBTeX

Johan Wiklund, Carl-Johan Westelius, Hans Knutsson, "Hierarchical Phase Based Disparity Estimation", LiTH-ISY-I, No. 1327, 1992.

KeywordsBiBTeX

Håkan Bårman, Gösta Granlund, "Hierarchical Feature Extraction for Computer-Aided Analysis of Mammograms", LiTH-ISY-R, No. 1448, 1992.

KeywordsBiBTeX

Johan Wiklund, Hans Knutsson, Roland Wilson, "A Hierarchical Stereo Algorithm", LiTH-ISY-I, No. 1167, 1991.

KeywordsBiBTeX

Carl-Johan Westelius, Hans Knutsson, "ESPRIT Basic Research Action 3038, Vision as Process, DS.A.1.1: Preliminary Software for Feature Extraction", -, 1991.

KeywordsBiBTeX

Andrew Calway, "Incorporating Orientation Selectivity in Wavelet Transforms: For Multi--Resolution Fourier Analysis of Images", LiTH-ISY-I, No. 1243, 1991.

AbstractKeywordsBiBTeXFulltext

Roland Wilson, Andrew Calway, Edward R. S. Pearson, "A generalised wavelet transform for Fourier analysis: The multiresolution Fourier transform and its application to image and audio signal analysis", LiTH-ISY-I, No. 1177, 1991.

KeywordsBiBTeX

Håkan Bårman, Hans Knutsson, Gösta H. Granlund, "Using Principal Direction Estimates for Shape and Acceleration Description", LiTH-ISY-I, No. 1231, 1991.

KeywordsBiBTeX

Carl-Fredrik Westin, Hans Knutsson, "Line Segmentation by Clustering in Möbius-Hough Space", LiTH-ISY-I, No. 1221, 1991.

KeywordsBiBTeX

Carl-Johan Westelius, Gösta Granlund, "Integrated Analyzes-Control Structure for Robotic Systems", -, 1991.

KeywordsBiBTeX

Carl-Fredrik Westin, Hans Knutsson, "ESPRI Basic Research Action 3038, Vision as Process, DR.A.1.2: Definition of feature generating procedures", -, 1991.

KeywordsBiBTeX

Håkan Bårman, Gösta H. Granlund, Hans Knutsson, "Hierarchical Curvature Estimation and Description", LiTH-ISY-I, No. 1095, 1990.

KeywordsBiBTeX

Carl-Johan Westelius, Hans Knutsson, Gösta H. Granlund, "Focus of Attention Control", LiTH-ISY-I, No. 1140, 1990.

KeywordsBiBTeX

Carl-Fredrik Westin, Hans Knutsson, "A Parameter Mapping for Line Segmentation", LiTH-ISY-I, No. 1151, 1990.

KeywordsBiBTeX

Arto Järvinen, Johan Wiklund, "Study of information mapping in Kohonen--Networks", LiTH-ISY-I, No. 0978, 1989.

KeywordsBiBTeX

Gösta H. Granlund, "Image Processing Systems and Components", LiTH-ISY-I, No. 1016, 1989.

KeywordsBiBTeX

Gösta H. Granlund, "Discriminant Functions, Linear Operations and Learning", LiTH-ISY-I, No. 1015, 1989.

KeywordsBiBTeX

Gösta H. Granlund, "Information Representation in Image Analysis Algorithms", LiTH-ISY-I, No. 1017, 1989.

KeywordsBiBTeX

Håkan Bårman, Hans Knutsson, Gösta H. Granlund, "Mechanisms for Striate Cortex Organization", LiTH-ISY-I, No. 1020, 1989.

KeywordsBiBTeX

Carl-Fredrik Westin, Carl-Johan Westelius, "Brain chaos. A feature or a bug?", LiTH-ISY-I, No. 0990, 1989.

KeywordsBiBTeX

Arto Järvinen, "Information representation in neural networks -- A survey", LiTH-ISY-I, No. 0994, 1989.

AbstractKeywordsBiBTeXFulltext

Gösta H. Granlund, "Integrated Analysis-Response Structures for Robotics Systems", LiTH-ISY-I, No. 0932, 1988.

KeywordsBiBTeX

Gösta H. Granlund, "Bi-Directionally Adaptive Models in Image Analysis", LiTH-ISY-I, No. 0930, 1988.

KeywordsBiBTeX

Mats Andersson, Gösta H. Granlund, "A Hybrid Image Processing Architecture", LiTH-ISY-I, No. 0929, 1988.

KeywordsBiBTeX

Gösta H. Granlund, Hans Knutsson, "Compact Associative Representation of Structural Information", LiTH-ISY-I, No. 0931, 1988.

KeywordsBiBTeX

Gösta H. Granlund, "Integrated Analysis-Response Structures for Robotics Systems", LiTH-ISY-I, No. 0932, 1988.

KeywordsBiBTeX

Gösta H. Granlund, "Magnitude Representation of Feature Variables", LiTH-ISY-I, No. 0933, 1988.

KeywordsBiBTeX

Håkan Bårman, Leif Haglund, Gösta H. Granlund, "Context Dependent Hierarchical Image Processing for Remote Sensing Data, Part Two: Contextual Classification and Segmentation", LiTH-ISY-I, No. 0924, 1988.

KeywordsBiBTeX

Josef Bigun, "Impressions from Picture Processing in USA and Japan", LiTH-ISY-I, No. 0892, 1988.

KeywordsBiBTeX

Josef Bigun, "Detection of Linear Symmetry in Multiple Dimensions for Description of Local Orientation and Optical Flow", LiTH-ISY-I, No. 893, 1988.

AbstractKeywordsBiBTeX

Fritz Albregtsen, "Enhancing Satellite Images of the Antarctic Snow and Ice Cover by Context Dependent Anisotropic Nonstationary Filtering.", LiTH-ISY-I, No. 0852, 1987.

KeywordsBiBTeX

Josef Bigun, "Optimal Orientation Detection of Circular Symmetry.", LiTH-ISY-I, No. 0871, 1987.

KeywordsBiBTeX

Josef Bigun, "Optimal Orientation Detection of Linear Symmetry", LiTH-ISY-I, No. 828, 1987.

AbstractKeywordsBiBTeXFulltext

Gösta H. Granlund, "Introduction to GOP Computer Vision.", LiTH-ISY-I, No. 0849, 1986.

KeywordsBiBTeX

Håkan Bårman, Gösta H. Granlund, Hans Knutsson, L. Näppä, "Context Dependent Hierarchical Image Processing for Remote Sensing Data.", LiTH-ISY-I, No. 0824, 1986.

KeywordsBiBTeX

Josef Bigun, Gösta H. Granlund, "Central Symmetry Modelling", LiTH-ISY-I, No. 789, 1986.

AbstractKeywordsBiBTeXFulltext

Lars Näppä, Gösta H. Granlund, "Texture Analysis and Description.", LiTH-ISY-I, No. 0775, 1985.

KeywordsBiBTeX

Gösta Granlund, "Images and Computers", LiTH-ISY-I, No. 0701, 1984.

KeywordsBiBTeX

Roland Wilson, "The Uncertainty Principle in Image Coding", LiTH-ISY-I, No. 0579, 1983.

KeywordsBiBTeX

Roland Wilson, "A Class of Local Centroid Algorithms for Classification and Quantization in Spaces of Arbitrary Dimension", LiTH-ISY-I, No. 0610, 1983.

KeywordsBiBTeX

Roland Wilson, Gösta Granlund, "The Uncertainty Principle in Image Processing", LiTH-ISY-I, No. 0576, 1983.

KeywordsBiBTeX

Roland Wilson, "Quad-Tree Predictive Coding: A New Class of Image Data Compression Algorithms", LiTH-ISY-I, No. 0609, 1983.

KeywordsBiBTeX

Roland Wilson, "Uncertainty, Eigenvalue Problems and Filter Design", LiTH-ISY-I, No. 0580, 1983.

KeywordsBiBTeX

Roland Wilson, "The Uncertainty Principle in Vision", LiTH-ISY-I, No. 0581, 1983.

KeywordsBiBTeX

Gösta H. Granlund, "Hierarchical Distributed Data Structures and Operations", LiTH-ISY-I, No. 0512, 1982.

KeywordsBiBTeX

Hans Knutsson, "Design of Convolution Kernels", LiTH-ISY-I, No. 0557, 1982.

AbstractKeywordsBiBTeX

Gösta H. Granlund, Hans Knutsson, Martin Hedlund, "Hierarchical Processing of Structural Information", LiTH-ISY-I, No. 0481, 1981.

KeywordsBiBTeX

Murat Kunt, "Picture Coding with the General Operator Processor (GOP)", LiTH-ISY-I, No. 0370, 1980.

KeywordsBiBTeX

Hans Knutsson, "3-D Reconstruction by Fourier Techniques with Error Estimates", LiTH-ISY-I, No. 0214, 1978.

KeywordsBiBTeX

Gösta H. Granlund, "Computer Processing and Display of Chromosome Image Information", LiTH-ISY-I, No. 0023, 1973.

KeywordsBiBTeX

Datorseende (CVL)

Forskning

Publications

Datasets

Projects

Seminars

Patent, Reports, and Student theses