Yuyan Li

Seattle, Washington, United States

241 followers 223 connections

View mutual connections with Yuyan

Welcome back

Email or phone

Password

Forgot password?

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Join to view profile

Apple

University of Missouri-Columbia

Activity

 My team at Apple is looking for a Research Engineer that (1) Has experience with large AI models and systems (2) Can translate research papers and…

 My team at Apple is looking for a Research Engineer that (1) Has experience with large AI models and systems (2) Can translate research papers and…

Liked by Yuyan Li
360 Videos to Gaussian Splattings (Volumography) I developed an optimized workflow for volumography, specifically for converting 360-degree videos…

360 Videos to Gaussian Splattings (Volumography) I developed an optimized workflow for volumography, specifically for converting 360-degree videos…

Liked by Yuyan Li
🧨 diffusers 🤝 bitsandbytes ⚡️ We're shipping native quantization support in diffusers, starting with bitsandbytes 🤗 What's supported? 🧿 1…

🧨 diffusers 🤝 bitsandbytes ⚡️ We're shipping native quantization support in diffusers, starting with bitsandbytes 🤗 What's supported? 🧿 1…

Liked by Yuyan Li

Join now to see all activity

Experience

Apple

Seattle, Washington, United States
-

Sunnyvale, California, United States
-

Sunnyvale, California, United States
-

Sunnyvale, California, United States

Education

University of Missouri-Columbia

2014 - 2021
2009 - 2013

Publications

OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion

CVPR 2022 March 2, 2022
Other authors
Fast Point Voxel Convolution Neural Network with Selective Feature Fusion for Point Cloud Analysis

International Symposium on Visual Computing 2021, oral
Other authors
See publication
Multi-scale Network with Attentional Multi-resolution Fusion for Point Cloud Semantic Segmentation

ICPR 2022 (submitted, under review)
Other authors
PanoDepth: A Two Stage Approach for Monocular Omnidirectional Depth Estimation

International Conference on 3D Vision (3DV), 2021
Other authors
See publication
SPNet:Multi-Shell Kernel Convolution for Point Cloud Semantic Segmentation

International Symposium on Visual Computing 2021, oral
Other authors
See publication

Patents

Multi-View consistency regularization for semantic interpretation of Equirectangular panoramas

Filed October 20, 2021 US patent app. 17/545,673

We invented a dense regression framework that estimates 360-degree (omni-directional) depth or
segmentation maps from an Equi-Rectangular Projection (ERP) image. Our method follows a general
encoder-decoder pipeline, which involves both convolutional layers and global attention layers. Our
contributions are three-folds. First, we invented a distortion-free convolutional module designed to handle
the varying distortion in 360 image across different regions. Second, we developed the…

We invented a dense regression framework that estimates 360-degree (omni-directional) depth or
segmentation maps from an Equi-Rectangular Projection (ERP) image. Our method follows a general
encoder-decoder pipeline, which involves both convolutional layers and global attention layers. Our
contributions are three-folds. First, we invented a distortion-free convolutional module designed to handle
the varying distortion in 360 image across different regions. Second, we developed the self-attention
the module which uses distortion-free image embedding to compute the appearance attention and use

spherical distance to compute the positional attention. Third, we are the first to use transformer and self-
attention architecture to solve 360 dense regression.
Method for omnidirectional dense regression for machine perception tasks via distortion-free CNN and spherical self-attention

Filed October 20, 2020 US patent app. 16/836,290

This invention introduces a novel regularization term to improve the performance of a deep neural network for semantic interpretation of equal-rectangular panorama images. Our approach utilizes the consistencies between different views of panorama images to reduce the needs of large amount of labelled ground truth data during training. Our innovation can be applied to various business areas such as building construction & maintenance, augmented & virtual reality businesses to reduce the costs.

Projects

Omnidirectional RGB-D Image Representation and Scene Understanding

2019 - 2021

Explored 360 omnidirectional geometry and representation. Developed deep learning-based solutions to address the problems of indoor layout estimation, monocular depth estimation, and stereo matching under 360 image domain. The proposed framework for monocular depth estimation outperformed the current state-of-the-arts by a margin.

Adopted and customized vision transformer on 360 image representation, achieved top performance on scene understanding tasks such as depth prediction and…

Explored 360 omnidirectional geometry and representation. Developed deep learning-based solutions to address the problems of indoor layout estimation, monocular depth estimation, and stereo matching under 360 image domain. The proposed framework for monocular depth estimation outperformed the current state-of-the-arts by a margin.

Adopted and customized vision transformer on 360 image representation, achieved top performance on scene understanding tasks such as depth prediction and semantic segmentation.
3D Point Cloud Feature Learning, Reconstruction, and Semantic Segmentation

2017 - 2021

Performed point cloud feature learning and representation on both large-scale indoor data and outdoor Lidar benchmarks. Designed and optimized deep learning and computer vision algorithms with a focus on efficient and effective performance for point cloud semantic segmentation and reconstruction tasks.

Developed novel algorithms and designed CNN architectures based on both voxel and point convolution for the task of point cloud semantic segmentation. Achieved top-ranking performances in…

Performed point cloud feature learning and representation on both large-scale indoor data and outdoor Lidar benchmarks. Designed and optimized deep learning and computer vision algorithms with a focus on efficient and effective performance for point cloud semantic segmentation and reconstruction tasks.

Developed novel algorithms and designed CNN architectures based on both voxel and point convolution for the task of point cloud semantic segmentation. Achieved top-ranking performances in several challenging datasets, such as S3DIS, ScanNet, and SemanticKitti, etc.
Biomedical Image Synthesis

2017 - 2017

Used CNN, GAN architectures to synthesize unseen, high-quality, CT images for recovering full tomographic information from CT scans. The well-designed architecture performs optical flow estimation and images interpolation/extrapolation and receives state-of-the-art accuracy.
Multi-view RGB-D Image Registration and 3D Model Reconstruction

2016 - 2016

Implemented structure from motion, which is based on traditional SIFT feature matching, and bundle adjustment techniques to find correspondences between indoor multi-view image captured by Kinect v2 and reconstruct 3D scene represented as point cloud.
3D Textured Mesh Model Reconstruction

2015 - 2015

Developed algorithms to convert 3D point clouds of buildings into simplified, high-quality mesh models with real-world image textures. Generated consistent building model texture by performing image stitching

Languages

Chinese

Native or bilingual proficiency
English

Full professional proficiency

More activity by Yuyan

Apple at it again - It's wild that you can just create SoTA depth maps on a consumer GPU in a fraction of a second! 🤯 ML Depth Pro open sourced…

Apple at it again - It's wild that you can just create SoTA depth maps on a consumer GPU in a fraction of a second! 🤯 ML Depth Pro open sourced…

Liked by Yuyan Li
After many requests over the past couple of years, we’ve worked on expanding the license terms for the Arkitscene dataset. It’s now available for…

After many requests over the past couple of years, we’ve worked on expanding the license terms for the Arkitscene dataset. It’s now available for…

Liked by Yuyan Li
We finally know what happened at OpenAI last year #ai #cg #vfx

We finally know what happened at OpenAI last year #ai #cg #vfx

Liked by Yuyan Li
✨ Introducing Med-Gemini, our new family of AI research models specialized for medicine! ✨ Med-Gemini models are tuned from Gemini, building on its…

✨ Introducing Med-Gemini, our new family of AI research models specialized for medicine! ✨ Med-Gemini models are tuned from Gemini, building on its…

Liked by Yuyan Li
Exciting news! Later this quarter, Kokai will open up its beta version, including the ability to spend the SP500+. While Adweek beat us to announcing…

Exciting news! Later this quarter, Kokai will open up its beta version, including the ability to spend the SP500+. While Adweek beat us to announcing…

Liked by Yuyan Li
Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and…

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and…

Liked by Yuyan Li
The Apple Vision Pro's spatial understanding is incredible. It uses AI to process input from the LiDAR Scanner and multiple cameras, performing…

The Apple Vision Pro's spatial understanding is incredible. It uses AI to process input from the LiDAR Scanner and multiple cameras, performing…

Liked by Yuyan Li
Big News Day today! 1/2 We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine (ANE)…

Big News Day today! 1/2 We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine (ANE)…

Liked by Yuyan Li
My first project after joining Apple. We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine…

My first project after joining Apple. We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine…

Liked by Yuyan Li
Get Ready 做好准备 준비하기 Preparati Sois prêt Bereit machen तैयार हो जाओ Prepararse തയ്യാറാകൂ Bersedia Приготовься Připravit se Pasiruošk Готуйся تیار ہو…

Get Ready 做好准备 준비하기 Preparati Sois prêt Bereit machen तैयार हो जाओ Prepararse തയ്യാറാകൂ Bersedia Приготовься Připravit se Pasiruošk Готуйся تیار ہو…

Liked by Yuyan Li
Check out StableDreamer and how we reduce the multi-face Janus problem with a simple approach.

Check out StableDreamer and how we reduce the multi-face Janus problem with a simple approach.

Liked by Yuyan Li
arXiv -> alphaXiv: Students from Stanford have created alphaXiv, a forum where you can post comments and questions directly on top of arXiv papers…

arXiv -> alphaXiv: Students from Stanford have created alphaXiv, a forum where you can post comments and questions directly on top of arXiv papers…

Liked by Yuyan Li

View Yuyan’s full profile

See who you know in common
Get introduced
Contact Yuyan directly

Join to view full profile

Other similar profiles

Explore more posts

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Yuyan Li in United States

10 others named Yuyan Li in United States are on LinkedIn

See others named Yuyan Li

Add new skills with these courses

See all courses

Yuyan Li

Seattle, Washington, United States 241 followers 223 connections

Activity

 My team at Apple is looking for a Research Engineer that (1) Has experience with large AI models and systems (2) Can translate research papers and…

Liked by Yuyan Li

360 Videos to Gaussian Splattings (Volumography) I developed an optimized workflow for volumography, specifically for converting 360-degree videos…

Liked by Yuyan Li

🧨 diffusers 🤝 bitsandbytes ⚡️ We're shipping native quantization support in diffusers, starting with bitsandbytes 🤗 What's supported? 🧿 1…

Liked by Yuyan Li

Experience

Apple

-

-

-

Education

University of Missouri-Columbia

Publications

OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion

CVPR 2022 March 2, 2022

Fast Point Voxel Convolution Neural Network with Selective Feature Fusion for Point Cloud Analysis

International Symposium on Visual Computing 2021, oral

Multi-scale Network with Attentional Multi-resolution Fusion for Point Cloud Semantic Segmentation

ICPR 2022 (submitted, under review)

PanoDepth: A Two Stage Approach for Monocular Omnidirectional Depth Estimation

International Conference on 3D Vision (3DV), 2021

SPNet:Multi-Shell Kernel Convolution for Point Cloud Semantic Segmentation

International Symposium on Visual Computing 2021, oral

Patents

Multi-View consistency regularization for semantic interpretation of Equirectangular panoramas

Filed October 20, 2021 US patent app. 17/545,673

Method for omnidirectional dense regression for machine perception tasks via distortion-free CNN and spherical self-attention

Filed October 20, 2020 US patent app. 16/836,290

Projects

Omnidirectional RGB-D Image Representation and Scene Understanding

2019 - 2021

3D Point Cloud Feature Learning, Reconstruction, and Semantic Segmentation

2017 - 2021

Biomedical Image Synthesis

2017 - 2017

Multi-view RGB-D Image Registration and 3D Model Reconstruction

2016 - 2016

3D Textured Mesh Model Reconstruction

2015 - 2015

Languages

Chinese

Native or bilingual proficiency

English

Full professional proficiency

More activity by Yuyan

Apple at it again - It's wild that you can just create SoTA depth maps on a consumer GPU in a fraction of a second! 🤯 ML Depth Pro open sourced…

Liked by Yuyan Li

After many requests over the past couple of years, we’ve worked on expanding the license terms for the Arkitscene dataset. It’s now available for…

Liked by Yuyan Li

We finally know what happened at OpenAI last year #ai #cg #vfx

Liked by Yuyan Li

✨ Introducing Med-Gemini, our new family of AI research models specialized for medicine! ✨ Med-Gemini models are tuned from Gemini, building on its…

Liked by Yuyan Li

Exciting news! Later this quarter, Kokai will open up its beta version, including the ability to spend the SP500+. While Adweek beat us to announcing…

Liked by Yuyan Li

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and…

Liked by Yuyan Li

The Apple Vision Pro's spatial understanding is incredible. It uses AI to process input from the LiDAR Scanner and multiple cameras, performing…

Liked by Yuyan Li

Big News Day today! 1/2 We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine (ANE)…

Liked by Yuyan Li

My first project after joining Apple. We introduce principles for deploying efficient attention-based vision transformers to the Apple Neural Engine…

Liked by Yuyan Li

Get Ready 做好准备 준비하기 Preparati Sois prêt Bereit machen तैयार हो जाओ Prepararse തയ്യാറാകൂ Bersedia Приготовься Připravit se Pasiruošk Готуйся تیار ہو…

Liked by Yuyan Li

Check out StableDreamer and how we reduce the multi-face Janus problem with a simple approach.

Liked by Yuyan Li

arXiv -> alphaXiv: Students from Stanford have created alphaXiv, a forum where you can post comments and questions directly on top of arXiv papers…

Liked by Yuyan Li

View Yuyan’s full profile

Other similar profiles

Xiao Wang

Sourabh Hanamsheth

Srikar Y.

Lichen Wang

Giorgio Luigi Morales Luna

Seattle, Washington, United States

241 followers 223 connections