Light

devtae / mic-streaming-system-for-recognition Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 288 KB

음성인식 Real-Time 추론 시스템 구성 및 Response Time 실험

License: MIT License

Python 100.00%

mic-streaming-system-for-recognition's Introduction

Mic-Streaming-System-for-Recognition

위 프로젝트는 음성인식 Real-Time 추론 시스템 구성을 위한 Queue 시스템임.
아래에 Response Time 에 대한 실험을 진행하였음.
코드 구성
- main.py
  - 메인 함수
- model_speech_recognition/inference.py
  - 음성인식 추론 함수

Response Time 1초 보장 에 대한 조건을 만족하기 위하여, 우선적으로 모델에 대한 추론 시간 측정 을 진행하였음.
실험 환경 세팅
- CPU : 12th Gen Intel(R) Core(TM) i5-12400F
- 외장 HDD : Seagate ST5000LM000 2AN170 5TB
- 음성 데이터
  - 1 ~ 10 초 단일 음성 녹음 파일에 대한 추론 시간 측정
- 딥러닝 모델 구조
  - CNN Layer x 2
  - GRU Layer x 3
  - hidden_dim = 512
  - use_bidirectional = True

실험 결과
- 음성 파일 길이 와 추론 시간 사이의 유의미한 관계가 있음을 발견하였음.
- 음성 파일의 길이가 늘어날수록 요구되는 추론 시간이 길어졌음.
- Response Time 을 줄이기 위하여 음성 파일의 길이를 줄이는 방법을 택할 수 있음.

추가 실험 결과
- 추론 시간은 파일 입출력 + 딥러닝 계산 시간 으로 구성되어 있음. 추가적으로 확인해본 바로는, 딥러닝 계산 시간(model.recognize) 보다는 파일 입출력(parse_audio) 에 있어서 대부분의 시간(90%가 넘는)이 소요된다는 것을 알 수 있음.
- HDD 및 SSD 등 디스크의 환경에 따라 실험 결과가 많이 달라질 것임을 알 수 있음.
- 현재는 외장하드로 실험한 결과이므로, 내장 HDD 혹은 SSD 로 작업할 땐 훨씬 적은 Response Time 이 소요됨을 볼 수 있음.
- 실제로, 내장 HDD 환경인 HGST HUH721212AL 에서 실험해본 결과, 0.1 ~ 0.2 초로 이전 대비 훨씬 적은 시간 소요가 발생하였음.
- 따라서, Response Time 1초 보장 을 위하여 1. 빠른 입출력 디스크 이용, 2. 음성 파일 길이 줄이기 방식을 제안한다.

mic-streaming-system-for-recognition's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.