huckiyang

Follow

💮

love life. live life.

huckiyang

💮

love life. live life.

Follow

Speech, Alignments, Robust LMs

129 followers · 89 following

Sr. Staff Member, Apple
10:33 (UTC -07:00)
huckiyang.github.io/
@huckiyang
channel/UCSj3hCBIds5BpyO7A4F3l7A

Achievements

Achievements

Highlights

Pro

Pinned Loading

NVIDIA-NeMo/Speech NVIDIA-NeMo/Speech Public

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17.6k 3.5k
NVlabs/OmniVinci NVlabs/OmniVinci Public

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 672 52
Voice2Series-Reprogramming Voice2Series-Reprogramming Public

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification

TypeScript 71 11
jax-nemotron jax-nemotron Public

open hymba nemotron on jax / nnx

Python 3
Srijith-rkr/Whispering-LLaMA Srijith-rkr/Whispering-LLaMA Public

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 271 16
YUCHEN005/GenTranslate YUCHEN005/GenTranslate Public

Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"

Python 199 9