- Jakarta, Indonesia
-
12:02
(UTC +07:00) - https://sofian.hadiwijaya.co
- @sofianhw
Stars
A template for building web agents with Stagehand on Browserbase
Enable AI models for video production in the browser
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
openai realtime webrtc python client
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
The FluxGarage RoboEyes library draws smoothly animated robot eyes on OLED displays, using the Adafruit GFX library.
A course on aligning smol models.
On-device AI across mobile, embedded and edge for PyTorch
TinyML Cookbook, 2E_Published by Packt
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
Official Implementation of RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering.
GaussianSpeech: Audio-Driven Gaussian Avatars
Shuffle: A general purpose security automation platform. Our focus is on collaboration and resource sharing.
Compass Apache TVM is enhanced based on the Apache TVM for wide range of Neural Network (NN) models quick support, optimization and heterogeneous execution.
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
An Open-source LTE Downlink/Uplink Eavesdropper
Render Gaussian Splats using Metal on Apple platforms (iOS/iPhone/iPad, macOS, and visionOS)
Empowering everyone to host fast and efficient Minecraft servers.
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A complete daily plan for studying to become a machine learning engineer.