-
Unstructured
- State College, PA
- https://thinkregressively.netlify.app/
Stars
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/
Port of Google's language-detection library to Python.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
LLM training code for Databricks foundation models
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Python Library to evaluate VLM models' robustness across diverse benchmarks
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
DSPy: The framework for programming—not prompting—foundation models
Example of how to use LangChain and Vertex AI Generative AI to ask plain English questions about Pandas dataframes.
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A Python client for the Unstructured hosted API
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
State-of-the-Art Text Embeddings
InferSent sentence embeddings
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022