Giter Site home page Giter Site logo

zmoth / sherpa-onnx Goto Github PK

View Code? Open in Web Editor NEW

This project forked from k2-fsa/sherpa-onnx

0.0 0.0 0.0 3.82 MB

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

Home Page: https://k2-fsa.github.io/sherpa/onnx/index.html

License: Apache License 2.0

Shell 5.25% JavaScript 3.99% C++ 43.92% Python 16.54% C 3.46% Objective-C 0.01% Java 4.10% Go 2.34% C# 3.05% Kotlin 7.30% Swift 3.43% Makefile 0.17% HTML 0.44% CMake 6.00%

sherpa-onnx's Introduction

Introduction

This repository supports running the following functions locally

  • Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
  • Text-to-speech (i.e., TTS)
  • Speaker identification
  • Speaker verification
  • Spoken language identification
  • Audio tagging
  • VAD (e.g., silero-vad)
  • Keyword spotting

on the following platforms and operating systems:

with the following APIs

  • C++, C, Python, Go, C#
  • Java, Kotlin, JavaScript
  • Swift

Links for pre-built Android APKs

Description URL **用户
Streaming speech recognition Address 点此
Text-to-speech Address 点此
Voice activity detection (VAD) Address 点此
VAD + non-streaming speech recognition Address 点此
Two-pass speech recognition Address 点此
Audio tagging Address 点此
Audio tagging (WearOS) Address 点此
Speaker identification Address 点此
Spoken language identification Address 点此
Keyword spotting Address 点此

Links for pre-trained models

Description URL
Speech recognition (speech to text, ASR) Address
Text-to-speech (TTS) Address
VAD Address
Keyword spotting Address
Audio tagging Address
Speaker identification (Speaker ID) Address
Spoken language identification (Language ID) See multi-lingual Whisper ASR models from Speech recognition
Punctuation Address

Useful links

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

sherpa-onnx's People

Contributors

csukuangfj avatar pkufool avatar emreozkose avatar zhaomingwork avatar jingzhaoou avatar pingfengluo avatar yujinqiu avatar chiiyeh avatar manyeyes avatar karelvesely84 avatar frankyoujian avatar hiedean avatar w11wo avatar keanucui avatar longshiming avatar manickavela29 avatar mablue avatar kamirdin avatar erquren avatar jinzr avatar bubao avatar 20246688 avatar aask1357 avatar vsd-vector avatar bhaswa avatar daniel-dona avatar garylaurenceauava avatar neuxys avatar li563042811 avatar kajimacn avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.