Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Ganin, Y

.
  1. arXiv:2211.15089  [pdf, other

    cs.CL cs.LG

    Continuous diffusion for categorical data

    Authors: Sander Dieleman, Laurent Sartran, Arman Roshannai, Nikolay Savinov, Yaroslav Ganin, Pierre H. Richemond, Arnaud Doucet, Robin Strudel, Chris Dyer, Conor Durkan, Curtis Hawthorne, Rémi Leblond, Will Grathwohl, Jonas Adler

    Abstract: Diffusion models have quickly become the go-to paradigm for generative modelling of perceptual signals (such as images and sound) through iterative refinement. Their success hinges on the fact that the underlying physical phenomena are continuous. For inherently discrete and categorical data such as language, various diffusion-inspired alternatives have been proposed. However, the continuous natur… ▽ More

    Submitted 15 December, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 26 pages, 8 figures; corrections and additional information about hyperparameters

  2. arXiv:2211.04236  [pdf, other

    cs.CL cs.LG

    Self-conditioned Embedding Diffusion for Text Generation

    Authors: Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond

    Abstract: Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as is standard in language modeling. We propose Self-conditioned Embedding Diffusion, a continuous diffusion mechanism that operates on token embeddings and allows… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 15 pages

  3. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  4. arXiv:2105.02769  [pdf, other

    cs.CV cs.LG

    Computer-Aided Design as Language

    Authors: Yaroslav Ganin, Sergey Bartunov, Yujia Li, Ethan Keller, Stefano Saliceti

    Abstract: Computer-Aided Design (CAD) applications are used in manufacturing to model everything from coffee mugs to sports cars. These programs are complex and require years of training and experience to master. A component of all CAD models particularly difficult to make are the highly structured 2D sketches that lie at the heart of every 3D construction. In this work, we propose a machine learning model… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 24 pages, 11 figures, 3 tables

  5. arXiv:2002.10880  [pdf, other

    cs.GR cs.CV cs.LG stat.ML

    PolyGen: An Autoregressive Generative Model of 3D Meshes

    Authors: Charlie Nash, Yaroslav Ganin, S. M. Ali Eslami, Peter W. Battaglia

    Abstract: Polygon meshes are an efficient representation of 3D geometry, and are of central importance in computer graphics, robotics and games development. Existing learning-based approaches have avoided the challenges of working with 3D meshes, instead using alternative object representations that are more compatible with neural architectures and training approaches. We present an approach which models th… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  6. arXiv:1910.01007  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Doodling and Painting with Improved SPIRAL

    Authors: John F. J. Mellor, Eunbyung Park, Yaroslav Ganin, Igor Babuschkin, Tejas Kulkarni, Dan Rosenbaum, Andy Ballard, Theophane Weber, Oriol Vinyals, S. M. Ali Eslami

    Abstract: We investigate using reinforcement learning agents as generative models of images (extending arXiv:1804.01118). A generative agent controls a simulated painting environment, and is trained with rewards provided by a discriminator network simultaneously trained to assess the realism of the agent's samples, either unconditional or reconstructions. Compared to prior work, we make a number of improvem… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: See https://learning-to-paint.github.io for an interactive version of this paper, with videos

    ACM Class: I.2; I.4

  7. arXiv:1804.01118  [pdf, other

    cs.CV cs.LG stat.ML

    Synthesizing Programs for Images using Reinforced Adversarial Learning

    Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals

    Abstract: Advances in deep generative networks have led to impressive results in recent years. Nevertheless, such models can often waste their capacity on the minutiae of datasets, presumably due to weak inductive biases in their decoders. This is where graphics engines may come in handy since they abstract away low-level details and represent images as high-level programs. Current methods that combine deep… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

    Comments: 12 pages, 13 figures

  8. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  9. arXiv:1607.07215  [pdf, other

    cs.CV

    DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation

    Authors: Yaroslav Ganin, Daniil Kononenko, Diana Sungatullina, Victor Lempitsky

    Abstract: In this work, we consider the task of generating highly-realistic images of a given face with a redirected gaze. We treat this problem as a specific instance of conditional image generation and suggest a new deep architecture that can handle this task very well as revealed by numerical comparison with prior art and a user study. Our deep architecture performs coarse-to-fine warping with an additio… ▽ More

    Submitted 26 July, 2016; v1 submitted 25 July, 2016; originally announced July 2016.

    Comments: Fixed typos, 14 + 2 + 2 pages, ECCV 2016

  10. arXiv:1512.05300  [pdf, other

    cs.CV

    Multiregion Bilinear Convolutional Neural Networks for Person Re-Identification

    Authors: Evgeniya Ustinova, Yaroslav Ganin, Victor Lempitsky

    Abstract: In this work we propose a new architecture for person re-identification. As the task of re-identification is inherently associated with embedding learning and non-rigid appearance description, our architecture is based on the deep bilinear convolutional network (Bilinear-CNN) that has been proposed recently for fine-grained classification of highly non-rigid objects. While the last stages of the o… ▽ More

    Submitted 6 September, 2017; v1 submitted 16 December, 2015; originally announced December 2015.

    Comments: in AVSS 2017

  11. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  12. arXiv:1412.6553  [pdf, other

    cs.CV cs.LG

    Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition

    Authors: Vadim Lebedev, Yaroslav Ganin, Maksim Rakhuba, Ivan Oseledets, Victor Lempitsky

    Abstract: We propose a simple two-step approach for speeding up convolution layers within large convolutional neural networks based on tensor decomposition and discriminative fine-tuning. Given a layer, we use non-linear least squares to compute a low-rank CP-decomposition of the 4D convolution kernel tensor into a sum of a small number of rank-one tensors. At the second step, this decomposition is used to… ▽ More

    Submitted 24 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  13. arXiv:1409.7495  [pdf, other

    stat.ML cs.LG cs.NE

    Unsupervised Domain Adaptation by Backpropagation

    Authors: Yaroslav Ganin, Victor Lempitsky

    Abstract: Top-performing deep architectures are trained on massive amounts of labeled data. In the absence of labeled data for a certain task, domain adaptation often provides an attractive option given that labeled data of similar nature but from a different domain (e.g. synthetic images) are available. Here, we propose a new approach to domain adaptation in deep architectures that can be trained on large… ▽ More

    Submitted 27 February, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

  14. arXiv:1406.6558  [pdf, other

    cs.CV

    $ N^4 $-Fields: Neural Network Nearest Neighbor Fields for Image Transforms

    Authors: Yaroslav Ganin, Victor Lempitsky

    Abstract: We propose a new architecture for difficult image processing operations, such as natural edge detection or thin object segmentation. The architecture is based on a simple combination of convolutional neural networks with the nearest neighbor search. We focus our attention on the situations when the desired image transformation is too hard for a neural network to learn explicitly. We show that in… ▽ More

    Submitted 3 July, 2014; v1 submitted 25 June, 2014; originally announced June 2014.

  15. arXiv:1303.1968  [pdf, ps, other

    cond-mat.str-el cond-mat.mtrl-sci

    Mott localization in the correlated superconductor Cs3C60 resulting from the molecular Jahn-Teller effect

    Authors: Katalin Kamaras, Gyongyi Klupp, Peter Matus, Alexey Y. Ganin, Alec McLennan, Matthew J. Rosseinsky, Yasuhiro Takabayashi, Martin T. McDonald, Kosmas Prassides

    Abstract: Cs3C60 is a correlated superconductor under pressure, but an insulator under ambient conditions. The mechanism causing this insulating behavior is the combination of Mott localization and the dynamic Jahn-Teller effect. We show evidence from infrared spectroscopy for the dynamic Jahn-Teller distortion. The continuous change with temperature of the splitting of infrared lines is typical Jahn-Teller… ▽ More

    Submitted 8 March, 2013; originally announced March 2013.

    Comments: 6 pages, 4 figures, 1 supplementary movie; XXI International Symposium on the Jahn-Teller Effect, Tsukuba, Japan, August 26-31, 2012

    Journal ref: Journal of Physics: Conference Series 428 (2013) 012002

  16. arXiv:1204.5971  [pdf, ps, other

    physics.optics cond-mat.mtrl-sci

    Raman response of Stage-1 graphite intercalation compounds revisited

    Authors: J. C. Chacón-Torres, A. Y. Ganin, M. J. Rosseinsky, T. Pichler

    Abstract: We present a detailed in-situ Raman analysis of stage-1 KC8, CaC6, and LiC6 graphite intercalation compounds (GIC) to unravel their intrinsic finger print. Four main components were found between 1200 cm-1 and 1700 cm-1, and each of them were assigned to a corresponding vibrational mode. From a detailed line shape analysis of the intrinsic Fano-lines of the G- and D-line response we precisely dete… ▽ More

    Submitted 13 July, 2012; v1 submitted 26 April, 2012; originally announced April 2012.

    Comments: 6 pages, 3 figures, 2 tables

    Journal ref: Phys. Rev. B 86, 075406 (2012)

  17. arXiv:1202.0375  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Anomalous dependence of the c-axis polarized Fe B$_{1g}$ phonon mode with Fe and Se concentrations in Fe$_{1+y}$Te$_{1-x}$Se$_x$

    Authors: Y. J. Um, A. Subedi, P. Toulemonde, A. Y. Ganin, L. Boeri, M. Rahlenbeck, Y. Liu, C. T. Lin, S. J. E. Carlsson, A. Sulpice, M. J. Rosseinsky, B. Keimer, M. Le Tacon

    Abstract: We report an investigation of the lattice dynamical properties in a range of Fe$_{1+y}$Te$_{1-x}$Se$_{x}$ compounds, with special emphasis on the c-axis polarized vibration of Fe with B$_{1g}$ symmetry, a Raman active mode common to all families of Fe-based superconductors. We have carried out a systematic study of the temperature dependence of this phonon mode as a function of Se $x$ and excess F… ▽ More

    Submitted 23 February, 2012; v1 submitted 2 February, 2012; originally announced February 2012.

    Comments: 11 pages, 7 figures, 4 tables, to appear in Phys. Rev. B

    Journal ref: Phys, Rev. B 85, 064519 (2012)

  18. arXiv:1102.0488  [pdf

    cond-mat.supr-con

    Cation vacancy order in the K0.8+xFe1.6-ySe2 system: five-fold cell expansion accommodates 20% tetrahedral vacancies

    Authors: J. Bacsa, A. Y. Ganin, Y. Takabayashi, K. E. Christensen, K. Prassides, M. J. Rosseinsky, J. B. Claridge

    Abstract: Ordering of the tetrahedral site vacancies in two crystals of refined compositions K0.93(1)Fe1.52(1)Se2 and K0.862(3)Fe1.563(4)Se2 produces a fivefold expansion of the parent ThCr2Si2 unit cell in the ab plane which can accommodate 20% vacancies on a single site within the square FeSe layer. The iron charge state is maintained close to +2 by coupling of the level of alkali metal and iron vacancies… ▽ More

    Submitted 18 February, 2011; v1 submitted 2 February, 2011; originally announced February 2011.

    Comments: 5 pages 3 figures accepted for publication in Chemical Science Chem. Sci., DOI:10.1039/C1SC00070E

    Journal ref: Chem. Sci., 2011, 2 (6), 1054 - 1058

  19. arXiv:1007.3914  [pdf, ps, other

    cond-mat.supr-con

    Anisotropic fluctuations and quasiparticle excitations in FeSe_0.5Te_0.5

    Authors: A. Serafin, A. I. Coldea, A. Y. Ganin, M. J. Rosseinsky, K. Prassides, D. Vignolles, A. Carrington

    Abstract: We present data for the temperature dependence of the magnetic penetration depth lambda(T), heat capacity C(T), resistivity R(T) and magnetic torque ?tau for highly homogeneous single crystal samples of Fe1:0Se0:44(4)Te0:56(4). lambda(T) was measured down to 200mK in zero field. We find lambda(T) follows a power law lambda~T^n with n = 2.2 +/- 0.1. This is similar to some 122 iron-arsenides and li… ▽ More

    Submitted 1 October, 2010; v1 submitted 22 July, 2010; originally announced July 2010.

    Comments: 10 pages, 9 figures, submitted to PRB

    Journal ref: Phys. Rev. B 82, 104514 (2010)

  20. arXiv:1006.3411  [pdf, ps, other

    cond-mat.supr-con

    Two-electronic component behavior in the multiband FeSe$_{0.42}$Te$_{0.58}$ superconductor

    Authors: D. Arcon, P. Jeglic, A. Zorko, A. Potocnik, A. Y. Ganin, Y. Takabayashi, M. J. Rosseinsky, K. Prassides

    Abstract: We report X-band EPR and $^{125}$Te and $^{77}$Se NMR measurements on single-crystalline superconducting FeSe$_{0.42}$Te$_{0.58}$ ($T_c$ = 11.5(1) K). The data provide evidence for the coexistence of intrinsic localized and itinerant electronic states. In the normal state, localized moments couple to itinerant electrons in the Fe(Se,Te) layers and affect the local spin susceptibility and spin fluc… ▽ More

    Submitted 17 June, 2010; originally announced June 2010.

    Comments: 5 pages, 4 figures

    Journal ref: Physical Review B 82, 140508(R) (2010)

  21. Strong electron correlations in the normal state of FeSe0.42Te0.58

    Authors: A. Tamai, A. Y. Ganin, E. Rozbicki, J. Bacsa, W. Meevasana, P. D. C. King, M. Caffio, R. Schaub, S. Margadonna, K. Prassides, M. J. Rosseinsky, F. Baumberger

    Abstract: We investigate the normal state of the '11' iron-based superconductor FeSe0.42Te0.58 by angle resolved photoemission. Our data reveal a highly renormalized quasiparticle dispersion characteristic of a strongly correlated metal. We find sheet dependent effective carrier masses between ~ 3 - 16 m_e corresponding to a mass enhancement over band structure values of m*/m_band ~ 6 - 20. This is nearly… ▽ More

    Submitted 10 February, 2010; v1 submitted 16 December, 2009; originally announced December 2009.

    Comments: 5 pages, 4 figures, to appear in Phys. Rev. Lett

    Journal ref: Phys. Rev. Lett. 104, 097002 (2010)