Skip to content

Conversation

@kaisugi
Copy link
Member

@kaisugi kaisugi commented Dec 25, 2025

Summary

Added Reazon Holdings' speech language models from their collection and reorganized speech model categories by creating a new "Feature Extraction" section.

Changes

New Category Structure

音声認識 (Automatic Speech Recognition)

  • ASR models for end-to-end speech recognition

特徴抽出 (Feature Extraction) - NEW

  • Pre-trained models for feature extraction (HuBERT, wav2vec 2.0, Zipformer)
  • Used as foundation models for downstream tasks

その他 (Others)

  • Speech dialogue and other speech-related models

Added ASR Models

  1. Reazon HuBERT ASR (rs35kh, rs35kh-bpe)

    • Fine-tuned on ReazonSpeech v2.0
    • 2 variants with different tokenization
  2. Reazon Zipformer ASR (rs35kh, rs35kh-bpe)

    • Fine-tuned on ReazonSpeech v2.0
    • 2 variants with different tokenization
  3. Reazon wav2vec 2.0 ASR (base-rs35kh, large-rs35kh)

    • Fine-tuned on ReazonSpeech v2.0
    • Base and large variants

Added Feature Extraction Models

  1. Reazon HuBERT (base-k2)

    • Pre-trained on ReazonSpeech
  2. Reazon Zipformer (base-k2)

    • Pre-trained on ReazonSpeech

Reorganized Existing Models

Moved the following from "Others" to "Feature Extraction":

  • くしなだ (Kushinada) - HuBERT
  • 東大HuBERT (University of Tokyo HuBERT)
  • いざなみ (Izanami) - wav2vec 2.0
  • Reazon wav2vec 2.0 (base, large)

Documentation Updates

  • Updated Japanese README
  • Updated English README
  • Updated French README
  • Maintained consistent structure across all language versions

Closes #573

🤖 Generated with Claude Code

- Added ASR models: Reazon HuBERT ASR, Reazon Zipformer ASR, Reazon wav2vec 2.0 ASR
- Created new "Feature Extraction" category for speech models
- Moved feature extraction models (HuBERT, wav2vec 2.0, Zipformer) to new category
- Added base models: Reazon HuBERT (base-k2), Reazon Zipformer (base-k2)
- Updated across all three language versions (Japanese, English, French)

Closes #573

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
@kaisugi kaisugi force-pushed the add-reazon-speech-models branch from 4fdfb14 to 60c1c25 Compare December 25, 2025 13:47
@kaisugi kaisugi merged commit 9f67dc6 into main Dec 25, 2025
1 check passed
@kaisugi kaisugi deleted the add-reazon-speech-models branch December 25, 2025 13:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

speech language models by reazon-speech

2 participants