Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Picture for Tianyu Liu

Tianyu Liu

SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Add code
Nov 01, 2024
Figure 1 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 2 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 3 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Figure 4 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile
Viaarxiv icon

Aligning CodeLLMs with Direct Preference Optimization

Add code
Oct 24, 2024
Viaarxiv icon

PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation

Add code
Oct 23, 2024
Viaarxiv icon

Efficiently Computing Susceptibility to Context in Language Models

Add code
Oct 18, 2024
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

Add code
Oct 09, 2024
Viaarxiv icon

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Add code
Oct 02, 2024
Figure 1 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 2 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 3 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 4 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Viaarxiv icon

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Qwen2.5-Coder Technical Report

Add code
Sep 18, 2024
Viaarxiv icon

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Add code
Sep 04, 2024
Viaarxiv icon