To bridge the gap between humanities and computer science, we introduce a new program called TALK2DATA, which allows anyone with zero programming experience to immediately start doing data science.
To use this program, all that is required is a web browser (or a Raspberry Pi). Seriously, you don’t even need a keyboard! Just have a conversation with your computer (in any language) and have it create code to do all sorts of complex tasks for you. Even better, you can see or even hear both the code and the output, and learn programming!
This is the first system, ever built, that enables full-fledged voice-based programming using natural language in over 100 languages. Made possible by the recent breakthroughs in machine learning technology, particularly deep learning based on neural networks. Powered by Google APIs, Wolfram|Alpha, and the Wolfram Language with numerous functions that use machine learning to smartly import any data format, find patterns in the data, and more!
We have preloaded our program with dozens of semantically structured datasets about Champaign-Urbana in order to encourage everyone to explore and gain more insight into issues that are relevant to our community. Crowdsourcing data analysis in this manner will raise awareness and lead to rapid innovations. New datasets can be freely added to our repository.
-
Makes everybody a data scientist: If you know how to talk or write, then you’re now a programmer and a data scientist! Freely available online for everyone.
-
Teaches programming to kids and adults: Code can be created with intuitive natural language input independent of any programming languages or paradigms. This encourages computational thinking rather than learning arbitrary syntax rules.
-
Enhances accessibility: Helps people with physical disabilities develop code and analyze data more easily. Audio based input and output, 103 supported languages.
-
Preloaded with Champaign-Urbana community data: Contains dozens of datasets related to Champaign and Urbana hosted on our web server. Anyone can start exploring and analyzing this data immediately.
-
Smart import of any data format: Our smart import, powered by the Wolfram Language, can automatically upload most types of datasets while recognizing their formatting and headers, using machine learning. This also preserves physical units within the datasets and automatically converts and combines units during later operations.
-
Error correcting natural language processing: You can type as sloppily as you want, and our text recognition and spell checking algorithms will correct your spelling for you and figure out what you intended to do.
-
More functionality added everyday: Thousands of different operations already recognized. We are working hard on adding more and more everyday.
-
High performance cloud computing: Analyze huge data sets and perform computationally intensive calculations through a browser on any device including Chromebooks, Raspberry Pis or smartphones (also works on the Wolfram Cloud app).
The voice input is recognized with one of the most accurate voice recognition system that exists today: the Google Cloud Speech-to-Text engine, which is powered by deep neural networks and is constantly improving every minute. Next, the Google Translate API is used to convert the input string (supported for over 103 languages) into English.
Then our custom text-to-code parser converts the input sentences into Wolfram Language code. This code is executed partly on our servers and partly on the Wolfram Cloud. Unrecognized requests are outsourced to Wolfram|Alpha. Finally the results are displayed on the website.
- Develop mobile apps for Android/iOS and chatbots
Data science on the go, without having to type on tiny keyboards. All computations and data sets will be hosted for free on the cloud. We also plan to integrate handwriting recognition into the app. A prototype version with limited functionality is already available on the Wolfram Cloud app for iOS and Android.
- Enable general-purpose programming (TALK2CODE)
The framework we built can be extended to allow general-purpose programming in any programming language as well as symbolic/numerical mathematical computation, using your voice. We are working on using machine learning techniques to automatically teach our system how to learn new commands.
- Natural Language Processing, Sentence Recognition
- Neural Networks, Deep Learning, Machine Learning
- Smart Data Import, Data Science, Data Analysis, Wolfram Data Repository
- Wolfram|Alpha, Wolfram Language, Wolfram Cloud, Mathematica
- Google Cloud, Speech-to-Text, Speech Recognition, Voice-based Programming
- Accessibility, Computational Thinking, Code Literacy
- Champaign, Urbana, University of Illinois, HackCulture, Hackathon
- TALK2DATA, TALK2CODE
- Cloud Computing, Mobile Programming, Mobile Data Science