DSpace Repository :: Browsing by Author "Gutierrez, Diego"

Browsing by Author "Gutierrez, Diego"

Now showing 1 - 15 of 15

Coded Apertures for Defocus Deblurring
(The Eurographics Association, 2021) Masia, Belen; Corrales, Adrian; Presa, Lara; Gutierrez, Diego; Silva, F. and Gutierrez, D. and Rodríguez, J. and Figueiredo, M.
The field of computational photography, and in particular the design and implementation of coded apertures, has yielded impressive results in the last years. Among their applications lies defocus deblurring, in which we focus in this paper. Following the approach of previous works, we obtain near-optimal coded apertures using a genetic algorithm and an existing quality metric. We perform both synthetic and real experiments, testing the performance of the apertures along the dimensions of depth, size and shape. We additionally explore non-binary apertures, usually overlooked in the literature, and perform a comparative analysis with their binary counterparts.
Convolutional Sparse Coding for Capturing High‐Speed Video Content
(© 2017 The Eurographics Association and John Wiley & Sons Ltd., 2017) Serrano, Ana; Garces, Elena; Masia, Belen; Gutierrez, Diego; Chen, Min and Zhang, Hao (Richard)
Video capture is limited by the trade‐off between spatial and temporal resolution: when capturing videos of high temporal resolution, the spatial resolution decreases due to bandwidth limitations in the capture system. Achieving both high spatial temporal resolution is only possible with highly specialized and very expensive hardware, and even then the same basic trade‐off remains. The recent introduction of compressive sensing and sparse reconstruction techniques allows for the capture of high‐speed video, by coding the temporal information in a single frame, and then reconstructing the full video sequence from this single‐coded image and a trained dictionary of image patches. In this paper, we first analyse this approach, and find insights that help improve the quality of the reconstructed videos. We then introduce a novel technique, based on (CSC), and show how it outperforms the state‐of‐the‐art, patch‐based approach in terms of flexibility and efficiency, due to the convolutional nature of its filter banks. The key idea for CSC high‐speed video acquisition is extending the basic formulation by imposing an additional constraint in the temporal dimension, which enforces sparsity of the first‐order derivatives over time.Video capture is limited by the trade‐off between spatial and temporal resolution: when capturing videos of high temporal resolution, the spatial resolution decreases due to bandwidth limitations in the capture system. Achieving both high spatial and temporal resolution is only possible with highly specialized and very expensive hardware, and even then the same basic trade‐off remains. .
Efficient Propagation of Light Field Edits
(The Eurographics Association, 2021) Jarabo, Adrian; Masia, Belen; Gutierrez, Diego; Silva, F. and Gutierrez, D. and Rodríguez, J. and Figueiredo, M.
Light field editing is a complex task, due to the large amount of data and the need to keep consistency between views. This has hampered the creation of efficient edit propagation methods, similar to those existing for single images. We propose a framework to edit light fields at interactive rates, by propagating some sparse user edits in the full light field. This propagation is guided by a novel affinity function, which forces similar pixels (defined by our affinity space) to receive similar edits, thus ensuring consistency. To manage the light field's large amount of data, we propose a novel multi-dimensional downsampling technique: we first cluster pixels with high affinity, and then perform edit propagation over the downsampled data. We finally upsample back to the original full resolution, maintaining visual fidelity and view consistency between views.
EUROGRAPHICS 2017: Tutorials Frontmatter
(Eurographics Association, 2017) Bousseau, Adrien; Gutierrez, Diego;
EUROGRAPHICS 2018: CGF 37-2 Frontmatter
(The Eurographics Association and John Wiley & Sons Ltd., 2018) Gutierrez, Diego; Sheffer, Alla; Gutierrez, Diego; Sheffer, Alla
-
Exploring the Impact of 360º Movie Cuts in Users' Attention
(The Eurographics Association, 2020) Marañes, Carlos; Gutierrez, Diego; Serrano, Ana; Christie, Marc and Wu, Hui-Yin and Li, Tsai-Yen and Gandhi, Vineet
Virtual Reality (VR) has become more relevant since the first devices for personal use became available on the market. New content has emerged for this new medium with different purposes such as education, traning, entertainment, etc. However, the production workflow of cinematic VR content is still in an experimental phase. The main reason is that there is controversy between content creators on how to tell a story effectively. Unlike traditional filmmaking, which has been in development for more than 100 years, movie editing in VR has brought new challenges to be addressed. Now viewers have partial control of the camera and can watch every degree of the 360º that surrounds them, with the possibility of losing important aspects of the scene that are key to understand the narrative of the movie. Directors can decide how to edit the film by combining the different shots. Nevertheless, depending on the scene before and after the cut, viewers' behavior may be influenced. To address this issue, we analyze users' behavior through cuts in a professional movie, where the narrative plays an important role, and derive new insights that could potentially influence VR content creation, informing content creators about the impact of different cuts in viewers' behavior.
Generic Interactive Pixel-level Image Editing
(The Eurographics Association and John Wiley & Sons Ltd., 2019) Liang, Yun; Gan, Yibo; Chen, Mingqin; Gutierrez, Diego; Muñoz Orbañanos, Adolfo; Lee, Jehee and Theobalt, Christian and Wetzstein, Gordon
Several image editing methods have been proposed in the past decades, achieving brilliant results. The most sophisticated of them, however, require additional information per-pixel. For instance, dehazing requires a specific transmittance value per pixel, or depth of field blurring requires depth or disparity values per pixel. This additional per-pixel value is obtained either through elaborated heuristics or through additional control over the capture hardware, which is very often tailored for the specific editing application. In contrast, however, we propose a generic editing paradigm that can become the base of several different applications. This paradigm generates both the needed per-pixel values and the resulting edit at interactive rates, with minimal user input that can be iteratively refined. Our key insight for getting per-pixel values at such speed is to cluster them into superpixels, but, instead of a constant value per superpixel (which yields accuracy problems), we have a mathematical expression for pixel values at each superpixel: in our case, an order two multinomial per superpixel. This leads to a linear leastsquares system, effectively enabling specific per-pixel values at fast speeds. We illustrate this approach in three applications: depth of field blurring (from depth values), dehazing (from transmittance values) and tone mapping (from brightness and contrast local values), and our approach proves both favorably interactive and accurate in all three. Our technique is also evaluated with a common dataset and compared favorably.
Graph-Based Reflectance Segmentation
(The Eurographics Association, 2021) Garces, Elena; Gutierrez, Diego; Lopez-Moreno, Jorge; Silva, F. and Gutierrez, D. and Rodríguez, J. and Figueiredo, M.
Most of the unsupervised image segmentation algorithms use just RGB color information in order to establish the similarity criteria between pixels in the image. This leads in many cases to a wrong interpretation of the scene since these criteria do not consider the physical interactions which give raise to of those RGB values (illumination, geometry, albedo) nor our perception of the scene. In this paper, we propose a novel criterion for unsupervised image segmentation which not only relies on color features, but also takes into account an approximation of the materials reflectance. By using a perceptually uniform color space, we apply our criterion to one of the most relevant state of the art segmentation techniques, showing its suitability for segmenting images into small and coherent clusters of constant reflectance. Furthermore, due to the wide adoption of such algorithm, we provide for the first time in the literature an evaluation of this technique under several scenarios and different configurations of its parameters. Finally, in order to enhance both the accuracy of the segmentation and the inner coherence of the clusters, we apply a series of image processing filters to the input image (median, mean-shift, bilateral), analyzing their effects in the segmentation process. Our results can be transferred to any image segmentation algorithm.
Intrinsic Light Field Images
(© 2017 The Eurographics Association and John Wiley & Sons Ltd., 2017) Garces, Elena; Echevarria, Jose I.; Zhang, Wen; Wu, Hongzhi; Zhou, Kun; Gutierrez, Diego; Chen, Min and Zhang, Hao (Richard)
We present a method to automatically decompose a light field into its intrinsic shading and albedo components. Contrary to previous work targeted to two‐dimensional (2D) single images and videos, a light field is a 4D structure that captures non‐integrated incoming radiance over a discrete angular domain. This higher dimensionality of the problem renders previous state‐of‐the‐art algorithms impractical either due to their cost of processing a single 2D slice, or their inability to enforce proper coherence in additional dimensions. We propose a new decomposition algorithm that jointly optimizes the whole light field data for proper angular coherence. For efficiency, we extend Retinex theory, working on the gradient domain, where new albedo and occlusion terms are introduced. Results show that our method provides 4D intrinsic decompositions difficult to achieve with previous state‐of‐the‐art algorithms. We further provide a comprehensive analysis and comparisons with existing intrinsic image/video decomposition methods on light field images.We present a method to automatically decompose a into its intrinsic shading and albedo components. Contrary to previous work targeted to two‐dimensional (2D) single images and videos, a light field is a 4D structure that captures non‐integrated incoming radiance over a discrete angular domain. This higher dimensionality of the problem renders previous state‐of‐the‐art algorithms impractical either due to their cost of processing a single 2D slice, or their inability to enforce proper coherence in additional dimensions. We propose a new decomposition algorithm that jointly optimizes the whole light field data for proper angular coherence.
Mobile Computational Photography: Exposure Fusion on the Nokia N900
(The Eurographics Association, 2021) Echevarria, Jose I.; Gutierrez, Diego; Silva, F. and Gutierrez, D. and Rodríguez, J. and Figueiredo, M.
This paper explores the Nokia N900 platform as a versatile playground for mobile computational photography. We present the implementation of a simple yet practical application to acquire high quality tone mapped HDR images, and a relatively straightforward white balance selection. We have built everything from scratch: from the capture of the bracketed sequence of images with different exposures, to the final tone mapping algorithm. Completed in less than one week, showcasing the flexibility and productivity of the platform, our experiments have encouraged us to continue exploring all the possibilities this kind of devices offer.
Modeling Surround-aware Contrast Sensitivity
(The Eurographics Association, 2021) Yi, Shinyoung; Jeon, Daniel S.; Serrano, Ana; Jeong, Se-Yoon; Kim, Hui-Yong; Gutierrez, Diego; Kim, Min H.; Bousseau, Adrien and McGuire, Morgan
Despite advances in display technology, many existing applications rely on psychophysical datasets of human perception gathered using older, sometimes outdated displays. As a result, there exists the underlying assumption that such measurements can be carried over to the new viewing conditions of more modern technology. We have conducted a series of psychophysical experiments to explore contrast sensitivity using a state-of-the-art HDR display, taking into account not only the spatial frequency and luminance of the stimuli but also their surrounding luminance levels. From our data, we have derived a novel surroundaware contrast sensitivity function (CSF), which predicts human contrast sensitivity more accurately. We additionally provide a practical version that retains the benefits of our full model, while enabling easy backward compatibility and consistently producing good results across many existing applications that make use of CSF models. We show examples of effective HDR video compression using a transfer function derived from our CSF, tone-mapping, and improved accuracy in visual difference prediction.
Modelling Surround‐aware Contrast Sensitivity for HDR Displays
(© 2022 Eurographics ‐ The European Association for Computer Graphics and John Wiley & Sons Ltd, 2022) Yi, Shinyoung; Jeon, Daniel S.; Serrano, Ana; Jeong, Se‐Yoon; Kim, Hui‐Yong; Gutierrez, Diego; Kim, Min H.; Hauser, Helwig and Alliez, Pierre
Despite advances in display technology, many existing applications rely on psychophysical datasets of human perception gathered using older, sometimes outdated displays. As a result, there exists the underlying assumption that such measurements can be carried over to the new viewing conditions of more modern technology. We have conducted a series of psychophysical experiments to explore contrast sensitivity using a state‐of‐the‐art HDR display, taking into account not only the spatial frequency and luminance of the stimuli but also their surrounding luminance levels. From our data, we have derived a novel surround‐aware contrast sensitivity function (CSF), which predicts human contrast sensitivity more accurately. We additionally provide a practical version that retains the benefits of our full model, while enabling easy backward compatibility and consistently producing good results across many existing applications that make use of CSF models. We show examples of effective HDR video compression using a transfer function derived from our CSF, tone‐mapping and improved accuracy in visual difference prediction.
Progressive Transient Photon Beams
(© 2019 Eurographics ‐ The European Association for Computer Graphics and John Wiley & Sons Ltd, 2019) Marco, Julio; Guillén, Ibón; Jarosz, Wojciech; Gutierrez, Diego; Jarabo, Adrian; Chen, Min and Benes, Bedrich
In this work, we introduce a novel algorithm for transient rendering in participating media. Our method is consistent, robust and is able to generate animations of time‐resolved light transport featuring complex caustic light paths in media. We base our method on the observation that the spatial continuity provides an increased coverage of the temporal domain, and generalize photon beams to transient‐state. We extend stead‐state photon beam radiance estimates to include the temporal domain. Then, we develop a progressive variant of our approach which provably converges to the correct solution using finite memory by averaging independent realizations of the estimates with progressively reduced kernel bandwidths. We derive the optimal convergence rates accounting for space and time kernels, and demonstrate our method against previous consistent transient rendering methods for participating media.In this work, we introduce a novel algorithm for transient rendering in participating media. Our method is consistent, robust and is able to generate animations of time‐resolved light transport featuring complex caustic light paths in media. We base our method on the observation that the spatial continuity provides an increased coverage of the temporal domain, and generalize photon beams to transient‐state. We extend stead‐state photon beam radiance estimates to include the temporal domain. Then, we develop a progressive variant of our approach which provably converges to the correct solution using finite memory by averaging independent realizations of the estimates with progressively reduced kernel bandwidths. We derive the optimal convergence rates accounting for space and time kernels, and demonstrate our method against previous consistent transient rendering methods for participating media.
Single-image Full-body Human Relighting
(The Eurographics Association, 2021) Lagunas, Manuel; Sun, Xin; Yang, Jimei; Villegas, Ruben; Zhang, Jianming; Shu, Zhixin; Masia, Belen; Gutierrez, Diego; Bousseau, Adrien and McGuire, Morgan
We present a single-image data-driven method to automatically relight images with full-body humans in them. Our framework is based on a realistic scene decomposition leveraging precomputed radiance transfer (PRT) and spherical harmonics (SH) lighting. In contrast to previous work, we lift the assumptions on Lambertian materials and explicitly model diffuse and specular reflectance in our data. Moreover, we introduce an additional light-dependent residual term that accounts for errors in the PRTbased image reconstruction. We propose a new deep learning architecture, tailored to the decomposition performed in PRT, that is trained using a combination of L1, logarithmic, and rendering losses. Our model outperforms the state of the art for full-body human relighting both with synthetic images and photographs.
Structure-preserving Style Transfer
(The Eurographics Association, 2019) Calvo, Santiago; Serrano, Ana; Gutierrez, Diego; Masia, Belen; Casas, Dan and Jarabo, Adrián
Transferring different artistic styles to images while preserving their content is a difficult image processing task. Since the seminal deep learning approach of Gatys et al. [GEB16], many recent works have proposed different approaches for performing this task. However, most of them share one major limitation: a trade-off between how much the target style is transferred, and how much the content of the original source image is preserved [GEB16, GEB*17, HB17, LPSB17]. In this work, we present a structure-preserving approach for style transfer that builds on top of the approach proposed by Gatys et al. Our approach allows to preserve regions of fine detail by lowering the intensity of the style transfer for such regions, while still conveying the desired style in the overall appearance of the image. We propose to use a quad-tree image subdivision, and then apply the style transfer operation differently for different subdivision levels. Effectively, this leads to a more intense style transfer in large flat regions, while the content is better preserved in areas with fine structure and details. Our approach can be easily applied to different style transfer approaches as a post-processing step.

Browsing by Author "Gutierrez, Diego"

Results Per Page

Sort Options