GPU-optimized AI, Machine Learning, & HPC Software | NVIDIA NGC

Location via proxy:
[Report a bug] [Manage cookies] No cookies No scripts No ads No referrer Show this form

NGC Catalog

Welcome Guest

All You Need to Build AI. All in One Place.Welcome to the NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light

NVIDIA NIM

ProteinMPNNContainer

Predicts amino acid sequences from 3D structure of proteins.

Llama-3-Taiwan-70B-InstructContainer

NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs

Phind-CodeLlama-34B-v2-InstructContainer

Phind-CodeLlama-34B-v2 is a large language AI model based on CodeLlama, capable of generating code and proficient in Python, C/C++, TypeScript, Java, and more.

nemotron-4-340b-instructContainer

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Instruct inference through OpenAI compatible APIs

Getting Started

Language ModellingCollection - Natural Language Processing

A collection of easy to use, highly optimized Deep Learning Models for Language Modelling. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models

NeMo - Automatic Speech RecognitionCollection - Automatic Speech Recognition

This collection contains NeMo models for Automatic Speech Recognition (ASR): Speech to Text, Speech Classification, Speaker Diarization, Speaker Verification, Speaker Recognition, Command Recognition, Voice Activity Detection

DeepStream - CV DeploymentCollection - Intelligent Video Analytics

DeepStream SDK delivers a complete streaming analytics toolkit for AI based video and image understanding and multi-sensor processing. The DeepStream SDK brings deep neural networks and other complex processing tasks into a stream processing pipeline.

LLMs optimized for RTX PCsCollection - Windows Rtx Accelerated Models

A collection of TensorRT-LLM accelerated Windows RTX PC LLM models.

Command Line Interface

Want to get more from NGC? Everything you see here can be used and managed via our powerful CLI tools. Download Now

Documentation

We've got a whole host of documentation, covering the NGC UI and our powerful CLI. You can find out more here. Go to Documentation

AI Enterprise Documentation

Learn how to virtualize any application with NVIDIA virtual GPU technology. Go to Documentation

Enterprise Support

Get to access to knowledgebase articles and support cases. File a Ticket

Licensing Portal

Access the software & licensing portal for your products. Get Your Licenses

NGC Private Registry

Private Registries from NGC allow you to secure, manage, and deploy your own assets to accelerate your journey to AI. Learn More

Getting Started with NVIDIA AI Enterprise

cuOptCollection - High Performance Computing

NVIDIA cuOpt is a world record GPU-accelerated optimization AI microservice that empowers instant dynamic decision-making to solve routing problems with the best-known accuracy at scale.

Meta/Llama3-8b-instructContainer

NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs

Production Branch - May 2024 (PB 24h1)Collection - Deep Learning

Access the production-ready branches of AI frameworks and SDKs. Supported for 9 months with monthly security patches.

NVIDIA AI Enterprise Infra 5Collection - Infrastructure

Access Infrastructure and workload management software, exclusively available with your NVIDIA AI Enterprise subscription.

Popular Collections

Code LlamaAdvanced

Code Llama is an LLM capable of generating code, and natural language about code, from both code and natural language prompts.

Llama 2Advanced

Llama 2 is a large language AI model capable of generating text and code in response to prompts.

Build an AI Chatbot with RAGMachine Learning

Use a reference application to build a fully functional retrieval-augmented generation (RAG)-based AI chatbot built with NVIDIA NIMTM microservices

Automatic Speech RecognitionAutomatic Speech Recognition

A collection of easy to use, highly optimized Deep Learning Models for Recommender Systems. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models

NVIDIA HoloscanHealthcare

The AI sensor processing platform

Clara DiscoveryHealthcare

Clara Discovery is a collection of frameworks, applications, and AI models enabling GPU-accelerated computational drug discovery

Clara NLPHealthcare

Clara NLP is a collection of SOTA biomedical pre-trained language models as well as highly optimized pipelines for training NLP models on biomedical and clinical text

Clara ParabricksHealthcare

Clara Parabricks is a collection of software tools and notebooks for next generation sequencing, including short- and long-read applications. These tools are designed to be scalable, generating highly accurate results in an accelerated compute environmen

Popular Containers

Python Basic for AI Workbench

Python Basic - AI Workbench Default Container (Beta)

Python with CUDA 12.0 - AI Workbench Default Container (Beta)

Python with CUDA 12.2 - AI Workbench Default Container (Beta)

Manage and Monitor GPUs in Cluster Environments.

Popular Models

ChatGLM3-6B Chat Int4

ChatGLM3-6B is the latest open-source model in the ChatGLM series. ChatGLM3-6B introduces the following features (1) More Powerful Base Model (2) More Comprehensive Function Support (3) More Comprehensive Open-source Series.

GPUNet-0 pretrained weights (PyTorch, AMP, ImageNet)

GPUNet-0 ImageNet pretrained weights

Llama2-13b Chat Int4

LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.

Mistral-7B Chat Int4

The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.

Popular Resources

Endoscopy out of body Sample App Data

Holoscan Sample App data for Endoscopy out of body detection

Holoscan Cars Video

Video of cars for evaluating detection algorithms for Holoscan SDK.

Colonoscopy Sample App Data

Holoscan Sample App Data for AI Colonoscopy Segmentation of Polyps

Endoscopy Sample App Data

Holoscan Sample App Data for AI-based Endoscopy Tool Tracking

Popular Helm Charts

RAG Application: Multimodal Chatbot

This example showcases multi modal usecase in a RAG pipeline. It can understand any kind of images in PDF or .pptx (like graphs and plots) alongside text and tables.

RAG Application: Multiturn Chatbot

This example showcases a RAG workflow with multi-turn conversation capabilities.

RAG Application: Structured Data Chatbot

Sample RAG application which can handle question-answering from tabular data stored in CSV format.

RAG Application: Langchain Text QA Chatbot

A helm chart demonstrating a basic RAG pipeline built using langchain leveraging Nvidia NIM LLM's and Retrievers deployed on-prem.