Google Scholar

User profiles for Viorica Patraucean

Viorica Patraucean

Google DeepMind

Verified email at google.com

Cited by 2213

[PDF] thecvf.com

Understanding real world indoor scenes with synthetic data

A Handa, V Patraucean… - Proceedings of the …, 2016 - openaccess.thecvf.com

Scene understanding is a prerequisite to many high level tasks for any automated intelligent
machine operating in real world environments. Recent attempts with supervised learning …

Save Cite Cited by 442 Related articles All 15 versions View as HTML

[PDF] arxiv.org

Spatio-temporal video autoencoder with differentiable memory

V Patraucean, A Handa, R Cipolla - arXiv preprint arXiv:1511.06309, 2015 - arxiv.org

We describe a new spatio-temporal video autoencoder, based on a classic spatial image
autoencoder and a novel nested temporal autoencoder. The temporal encoder is represented …

Save Cite Cited by 364 Related articles All 8 versions View as HTML

[HTML] sciencedirect.com

[HTML][HTML] State of research in automatic as-built modelling

V Pătrăucean, I Armeni, M Nahangi, J Yeung… - Advanced Engineering …, 2015 - Elsevier

Building Information Models (BIMs) are becoming the official standard in the construction
industry for encoding, reusing, and exchanging information about structural assets. …

Save Cite Cited by 393 Related articles All 14 versions

[PDF] arxiv.org

Active acquisition for multimodal temporal data: A challenging decision-making task

…, C Cangea, E Vértes, A Jaegle, V Patraucean… - arXiv preprint arXiv …, 2022 - arxiv.org

We introduce a challenging decision-making task that we call active acquisition for
multimodal temporal data (A2MT). In many real-world scenarios, input features are not readily …

Save Cite Cited by 4 Related articles All 4 versions View as HTML

[PDF] neurips.cc

Perception test: A diagnostic benchmark for multimodal video models

V Patraucean, L Smaira, A Gupta… - Advances in …, 2024 - proceedings.neurips.cc

We propose a novel multimodal video benchmark-the Perception Test-to evaluate the perception
and reasoning skills of pre-trained multimodal models (eg Flamingo, BEiT-3, or GPT-4). …

Save Cite Cited by 32 Related articles All 4 versions View as HTML

[PDF] thecvf.com

Broaden your views for self-supervised video learning

…, M Malinowski, V Pătrăucean… - Proceedings of the …, 2021 - openaccess.thecvf.com

Most successful self-supervised learning methods are trained to align the representations of
two independent views from the data. State-of-the-art methods in video are inspired by …

Save Cite Cited by 136 Related articles All 7 versions View as HTML

[PDF] thecvf.com

A simple recipe for contrastively pre-training video-first encoders beyond 16 frames

…, J Chiu, J Heyward, V Patraucean… - Proceedings of the …, 2024 - openaccess.thecvf.com

Understanding long real-world videos requires modeling of long-range visual dependencies.
To this end we explore video-first architectures building on the common paradigm of …

Save Cite Cited by 10 Related articles All 3 versions View as HTML

[PDF] cnrs.fr

A parameterless line segment and elliptical arc detector with enhanced ellipse fitting

V Pătrăucean, P Gurdjos, RG Von Gioi - … 7-13, 2012, Proceedings, Part II …, 2012 - Springer

We propose a combined line segment and elliptical arc detector, which formally guarantees
the control of the number of false positives and requires no parameter tuning. The accuracy …

Save Cite Cited by 143 Related articles All 5 versions

[PDF] arxiv.org

gvnn: Neural network library for geometric computer vision

A Handa, M Bloesch, V Pătrăucean, S Stent… - … October 8-10 and 15-16 …, 2016 - Springer

We introduce gvnn, a neural network library in Torch aimed towards bridging the gap between
classic geometric computer vision and deep learning. Inspired by the recent success of …

Save Cite Cited by 133 Related articles All 4 versions

[PDF] cam.ac.uk

Scenenet: An annotated model generator for indoor scene understanding

A Handa, V Pătrăucean, S Stent… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org

We introduce SceneNet, a framework for generating high-quality annotated 3D scenes to
aid indoor scene understanding. SceneNet leverages manually-annotated datasets of real …

Save Cite Cited by 127 Related articles All 6 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Viorica Patraucean

Viorica Patraucean

Understanding real world indoor scenes with synthetic data

Spatio-temporal video autoencoder with differentiable memory

[HTML][HTML] State of research in automatic as-built modelling

Active acquisition for multimodal temporal data: A challenging decision-making task

Perception test: A diagnostic benchmark for multimodal video models

Broaden your views for self-supervised video learning

A simple recipe for contrastively pre-training video-first encoders beyond 16 frames

A parameterless line segment and elliptical arc detector with enhanced ellipse fitting

gvnn: Neural network library for geometric computer vision

Scenenet: An annotated model generator for indoor scene understanding