💮
love life. live life.
Speech, Alignments, Robust LMs
-
Sr. Staff Member, Apple
-
10:33
(UTC -07:00) - huckiyang.github.io/
- @huckiyang
- channel/UCSj3hCBIds5BpyO7A4F3l7A
Highlights
- Pro
Pinned Loading
-
NVIDIA-NeMo/Speech
NVIDIA-NeMo/Speech PublicA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
NVlabs/OmniVinci
NVlabs/OmniVinci PublicOmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
-
Voice2Series-Reprogramming
Voice2Series-Reprogramming PublicICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification
-
-
Srijith-rkr/Whispering-LLaMA
Srijith-rkr/Whispering-LLaMA PublicEMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
-
YUCHEN005/GenTranslate
YUCHEN005/GenTranslate PublicCode for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



