Giter Site home page Giter Site logo

claritylab / lucida Goto Github PK

View Code? Open in Web Editor NEW
4.8K 4.8K 883.0 1.29 GB

Speech and Vision Based Intelligent Personal Assistant

License: Other

Shell 3.21% Makefile 1.42% C++ 5.07% Python 10.72% Java 73.83% JavaScript 3.18% Thrift 0.11% Batchfile 0.05% CSS 0.35% HTML 1.15% Dockerfile 0.41% Jupyter Notebook 0.50%

lucida's People

Contributors

bitdeli-chef avatar bnsblue avatar cassiosantos avatar dalgaaf avatar dependabot[bot] avatar dmichelin avatar haoyifan avatar hillm3 avatar hoseoglee avatar jaylenw avatar jhauswald avatar kamalgalrani avatar kolewu avatar kshuyu avatar lisqu avatar mkyutani avatar rovinski avatar ryanniehaus avatar saijel avatar seong889 avatar xiaoweiw avatar ypkang avatar yqzhang avatar zhexuanc avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

lucida's Issues

Clarify that sirius is beta and insecure

I played around with sirius and detected that the sirius-web app gives you a python terminal if an error occurres. And I think that the whole project is at this time only for developers and should not be served for the public.

Just my minds ;)

sirius unix command shell

I'm working on a sirius client written in C and libcurl.

Usage: sirius [OPTION]... QUESTION ...

  [QUESTION]   QUESTION to ask sirius.
               If it is not specified, data will be
               taken from standard input.

  -a, --speechrecog [WAV FILE]
               WAV_FILE some voice audio file for sirius to recognice.

  -i, --imagematch [IMAGE_FILE]
               IMAGE_FILE some image file for sirius to match.

  -u, --url [URL]
               URL from sirius host..

  -c, --cert [CERTIFICATE_FILE]
               CERTIFICATE_FILE the certificate for sirius host.

  -v, --version
               Display the version number of sirius.

      --verbose
               Display verbose information to stderr.

  -h, --help   Display this help message.

This would allow to pipe questions to sirius:

echo "What is this?" | sirius -i ./eifel_tower.bmp

#Maybe it should answer voice recognition by default if no other flag is set.
sirius -s ./capital_of_italy.wav | sirius 

Innstalation and first run - pocketsphinx empty string

Firstly, thank you very much for so interesting approach!
When i run pocketsphinx_continuous -infile where.is.las.vegas.wav
I got a lot of information plus ... there is las vegas ... so it seems to work.

But! When I run ./start-asr-server.sh pocketsphinx and parallel
Starting server on localhost:8081, use to stop

$ ./sirius-asr-test.sh ../inputs/questions/where.is.las.vegas.wav
Your audio file is:
../inputs/questions/where.is.las.vegas.wav
Sending request to server...

***********************************************

and the server writes transcript []: (empty)
So, the question - why?

I am using last version 8b71aba for now.

Also I can't compile kaldi:

$ sudo ./compile-sirius-servers.sh
[sudo] password for timopheym:
Sphinx4 server done.
Java Pocketsphinx server done.
libtool: install: warning: relinking `libsphinxad.la'
libtool: install: warning: relinking `_sphinxbase.la'
libtool: install: warning: relinking `libpocketsphinx.la'
libtool: install: warning: relinking `_pocketsphinx.la'
C++ Pocketsphinx server done.
***configure failed: Could not find file /include/fst/fst.h:
    you may not have installed OpenFst.  See ../tools/INSTALL *** ```

So, installation process is not simple... ;) (it took me about 2 hours), but I hope we will go through it..

Installation problem as following http://sirius.clarity-lab.org/sirius-suite/#setup

sudo ./compile-sirius-servers.sh raises following erroneous output

Sphinx4 server done.
Java Pocketsphinx server done.
make: *** No targets specified and no makefile found.  Stop.
make: *** No rule to make target `install'.  Stop.
make: *** No targets specified and no makefile found.  Stop.
make: *** No rule to make target `install'.  Stop.
C++ Pocketsphinx server done.
***configure failed: Could not find file /include/fst/fst.h:
    you may not have installed OpenFst.  See ../tools/INSTALL ***

and if I run ./start-qa-server.sh, it raises also this

Error: Could not find or load main class info.ephyra.OpenEphyraServer

OpenCV installation has the following error as well

/home/retina18/Downloads/sirius/sirius-application/opencv-2.4.9/modules/gpu/src/nvidia/core/NCVPixelOperations.hpp(132): error: a storage class is not allowed in an explicit specialization

/home/retina18/Downloads/sirius/sirius-application/opencv-2.4.9/modules/gpu/src/nvidia/core/NCVPixelOperations.hpp(133): error: a storage class is not allowed in an explicit specialization

31 errors detected in the compilation of "/tmp/tmpxft_0000433a_00000000-21_NCVPyramid.compute_35.cpp1.ii".
CMake Error at cuda_compile_generated_NCVPyramid.cu.o.cmake:264 (message):
  Error generating file
  /home/retina18/Downloads/sirius/sirius-application/opencv-2.4.9/build/modules/gpu/CMakeFiles/cuda_compile.dir/src/nvidia/core/./cuda_compile_generated_NCVPyramid.cu.o


make[2]: *** [modules/gpu/CMakeFiles/cuda_compile.dir/src/nvidia/core/./cuda_compile_generated_NCVPyramid.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
make[1]: *** [modules/gpu/CMakeFiles/opencv_gpu.dir/all] Error 2
make: *** [all] Error 2

I guess something is wrong in installation scripts.

I am on Ubuntu 14.04 with GPU support.

GMM input file missing

The gmm benchmarks in sirius-suite refer to a non-existent input file gmm/input/gmm_data.txt

remove hardcoded magic number of threads in build scripts

In some buildscripts, the number of threads for concurrent builds is set to a fixed magical number:

https://github.com/jhauswald/sirius/blob/master/sirius-application/get-opencv.sh
https://github.com/jhauswald/sirius/blob/master/sirius-application/compile-sirius-servers.sh

here it is probably kind of ok, i guess travis does not do concurrent builds:
https://github.com/jhauswald/sirius/blob/master/.travis.yml

I am not sure, that 4 is always for everyone a good choice, as this number is completely system dependent. Usually you would default to non concrrent builds (-j 1) and give the user the option to set the number of threads manually or you could query nproc(1) to find out the number of cores.

where is libfst.so?

Finished building, I try to run but i get

./start-asr-server.sh
Starting server on localhost:8081, use to stop
..//src/online2bin/online2-wav-nnet2-latgen-faster: error while loading shared libraries: libfst.so.1: cannot open shared object file: No such file or directory
^CTraceback (most recent call last):
File "./start-asr-dnn-server.py", line 82, in
server.serve_forever()
File "/usr/lib/python2.7/SocketServer.py", line 236, in serve_forever
poll_interval)
File "/usr/lib/python2.7/SocketServer.py", line 155, in _eintr_retry
return func(*args)
KeyboardInterrupt

Make input only package

For those who start from the repo instead of the package online, it would be useful to have just a package with sirius-suite inputs and another with just the Wikipedia database to not have to download everything again. This covers #21 #28 #29

[sirius-suite] timing

Design framework to run each kernel X number of times and output statistics:

  • run-to-run variability
  • average run-time
  • average speedup (when applicable)
  • input size

Output in easily parse-able format (json, csv).

Do not advise users to run "./compile-sirius-servers.sh" with root privileges

The installation instructions (http://sirius.clarity-lab.org/sirius/) advise users to run various scripts with root privileges:

$ sudo ./compile-sirius-servers.sh

The problem here is, that compilers are NOT designed to be run as root and build scripts must not be designed to require root privileges, when just building the programs or libs.

"./compile-sirius-servers.sh" does not look like it requires root privileges, so I guess updating the installation instructions would be a first step.

libsdl1.2-dev will not install on Ubuntu 14.04

One of the packages installed by get-opencv.sh is libsdl1.2-dev. This fails to install on a fresh Ubuntu 14.04 Desktop. Errors below:

$ sudo apt-get install libsdl1.2-dev
Reading package lists... Done
Building dependency tree       
Reading state information... Done
Some packages could not be installed. This may mean that you have
requested an impossible situation or if you are using the unstable
distribution that some required packages have not yet been created
or been moved out of Incoming.
The following information may help to resolve the situation:

The following packages have unmet dependencies:
 libcheese-gtk23 : Depends: libclutter-gtk-1.0-0 (>= 0.91.8) but it is not going to be installed
                   Depends: libcogl15 (>= 1.15.8) but it is not going to be installed
 libcheese7 : Depends: libclutter-gst-2.0-0 (>= 0.10.0) but it is not going to be installed
              Depends: gstreamer1.0-clutter but it is not going to be installed
 libclutter-1.0-0 : Depends: libcogl-pango15 (>= 1.15.8) but it is not going to be installed
                    Depends: libcogl15 (>= 1.15.8) but it is not going to be installed
E: Error, pkgProblemResolver::Resolve generated breaks, this may be caused by held packages.

This error causes the entire command to fail, which will prevent all of those dependencies from being installed.

travis-ci for fanciness

Travis-CI is this awesome continuous integration tool that helps us to keep track of the compiling and building process.
More importantly, it will give us a cute icon in our README.md file similar to other open-source projects.
Build Status

Clarify that sirius will currently only build and run on Ubuntu

From reading over the build scripts, it is apparent, that this will only build on Ubuntu. (dependency on apt-get and adding ppas)
In the README fies, this is not as apparent and the installation says:

Sirius has been tested on Ubuntu 12.04 and 14.04.

Which could be more correctly formulated as:

Sirius requires Ubuntu 12.04 or 14.04.

Verbose install script output

Currently, the install scripts, particularly compile-sirius-servers.sh, are redirecting the output of a number of compile commands to /dev/null. This leads to very long stretches of time with no terminal output. We know we're compiling here, why not show command output on screen to give at least some indication of progress? So something like

make -j $THREADS 1>/dev/null

becomes:

make -j $THREADS

QA and Sphinx4 servers hang "insufficient number of threads..."

When running ./start-qa-server.sh, there's an issue with Jetty if the machine has a large number of threads and THREADS in ./start-qa-server.sh is smaller than the total threads, the server hangs and can't accept any requests.

Message: "...insufficient number of threads configured for SelectChanelConnector..."

Similar issue happens with ./start-asr-server.sh sphinx4.

Temporary workaround is to export THREADS=(total # threads available) although none of the services require more than a few (4 max).

libtool not found

I'm trying to build sirius on debian and I found something strange when I build kaldi tools.

After decompress tools.tar.gz, I'm trying to launch 'make' and I had this :

root@sirius:/home/sirius/sirius-application/speech-recognition/kaldi/tools# make
extras/check_dependencies.sh
extras/check_dependencies.sh: libtool is not installed.
extras/check_dependencies.sh: we recommend that you run (our best guess):
sudo apt-get install libtool

APT say it's already installed :
apt-get install libtool
libtool is already the newest version.

I looked the file "extras/check_dependencies.sh" and you use 'which' to check if packages are installed.
In my config "which libtool" doesn't return something so the script stops.

I remove "libtool" of the check list and all running yet (yes it's dirty !)

for f in make automake autoconf patch awk grep bzip2 gzip wget git; do
if ! which $f >&/dev/null; then
echo "$0: $f is not installed."
add_packages $f $f
fi
done

I don't known if it's a bug but I report-it if I can help someone

:)

Add 'sox' to the get-dependencies.sh script

After installing all components (with no problems) on Ubuntu 14.x, I get the sox command not found when trying out Kaldi. Sox did not get installed. Maybe add sudo apt-get install sox to get-dependencies.sh

opencv sources

Why git clone opencv (you say you need 2.4.9) when Debian have 2.4.9.1 in repository?

Better documentation

Unify the format of all the README.me files once the structure of the repository is settled down.
Write better documentation and copy them into the wiki page.

about performace of kaldi

when I test the ASR part of kaldi, with any example wav file, the server should take more than 10 seconds to response the answer. Is the time cost normal for a PC?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.