Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View melttt's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report melttt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 1 Updated Feb 8, 2024

剖析 STL 是一种享受的过程!

C++ 2,400 756 Updated Jan 15, 2018

看<Parsing Techniques -- A practical Guide>,顺手撸的一点代码,纯练习用.

C++ 3 1 Updated Apr 5, 2017

Chinese translation of Bjarne Stroustrup's HOPL4 paper

2,197 398 Updated Oct 14, 2024

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Python 70 17 Updated Oct 18, 2022

HMM Tutorial

Jupyter Notebook 12 8 Updated Apr 15, 2018

VB Diarization with Eigenvoice and HMM Priors, refactored

Python 14 3 Updated Jul 27, 2021

Variational Bayes HMM over x-vectors diarization

Python 251 57 Updated Jan 15, 2024

Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset

Python 57 7 Updated Jan 18, 2022

In defence of metric learning for speaker recognition

Python 1,041 273 Updated Mar 26, 2024

Diarization scoring tools.

Python 216 41 Updated Mar 28, 2023

Simple, online, and realtime tracking of multiple objects in a video sequence.

Python 3,939 1,092 Updated Nov 28, 2023

AI Face comparison using FaceNet

Python 78 23 Updated Mar 25, 2023

Sequence modeling benchmarks and temporal convolutional networks

Python 4,154 875 Updated Mar 28, 2022

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python 388 100 Updated May 18, 2023

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,163 764 Updated Oct 18, 2024

Pytorch code for "Rethinking CNN Models for Audio Classification"

Python 122 30 Updated Mar 25, 2021

CNN based audio classifier by pytorch (LeNet / VGG / ResNet)

Jupyter Notebook 7 1 Updated Dec 6, 2019

You can find the speech algorithms you want here

C 749 245 Updated Oct 21, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,594 225 Updated Oct 16, 2024

Multimodal speaker diarization using pre-trained audio-visual synchronization model

Python 9 6 Updated May 12, 2020

Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.

Python 117 17 Updated Sep 18, 2020

Tiny-ImageNet Classifier using Pytorch

Jupyter Notebook 82 28 Updated Nov 1, 2018

an Audio-Visual Voice Activity Detection using Deep Learning

Python 48 11 Updated Apr 7, 2019

A PyTorch implementation of End-to-End Neural Diarization

Python 98 15 Updated Jun 19, 2023

Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus

Python 52 30 Updated Dec 4, 2019

A curated list of Multimodal Related Research.

Python 1,306 150 Updated Aug 5, 2023

implement of LSTM+CRF with pytorch

Python 44 8 Updated Jul 9, 2020

[CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph

Python 151 25 Updated Feb 20, 2020

Memory Enhanced Global-Local Aggregation for Video Object Detection, CVPR2020

Python 566 115 Updated May 13, 2021
Next