Giter Site home page Giter Site logo

linecode / droidlet Goto Github PK

View Code? Open in Web Editor NEW

This project forked from facebookresearch/fairo

0.0 1.0 0.0 360.92 MB

A modular embodied agent architecture and platform for building embodied agents

License: MIT License

Dockerfile 0.02% Shell 0.10% Python 8.52% Jupyter Notebook 80.26% Makefile 0.01% HTML 1.54% CMake 0.02% C++ 1.20% Lua 0.08% JavaScript 8.23% CSS 0.03%

droidlet's Introduction


This repository, corresponding tutorials and docs are still being refined (and not ready yet).

droidlet helps you rapidly build agents (real or virtual) that perform a wide variety of tasks specified by humans. The agents can use natural language, memory and humans in the loop.

droidlet is an early research project for AI researchers to explore ideas around grounded dialogue, interactive learning and human-computer interfaces.

droidlet is in active development and is fairly unstable in design, API, performance and correctness. It is not meant for any production use.

Reach out to us at [email protected], to discuss your use case or just share your thoughts!

Getting Started

You want to do one of three things:

  1. Robots: Reproduce and extend the PyRobot based agent on real robots such as LocoBot or photo-realistic simulators such as AIHabitat.
  2. Minecraft: Reproduced and extend the minecraft based game agent
  3. New Agent: write your own agent from scratch, starting from our base_agent abstraction

Robots Minecraft New Agent
1. Clone the source code
git clone --recursive https://github.com/facebookresearch/droidlet.git
cd droidlet
            
2. Check system requirements
- Linux
- Python 3 (Anaconda recommended)
- NVIDIA GPU (8GB+)
- PyRobot-compatible robot or sim
  - Habitat-sim instructions below
        
        
- Linux
- Python 3 (Anaconda recommended)
- NVIDIA GPU (4GB+)
- Minecraft
  - more instructions below
        
        
- Linux
- Python 3 (Anaconda recommended)
        
3. Install dependencies
conda create -n droidlet_env python=3.7 \
   pytorch==1.7.1 torchvision==0.8.2 \
   cudatoolkit=11.0 -c pytorch
conda activate droidlet_env
pip install -r \
    agents/locobot/requirements.txt
python setup.py develop
        
        
pip install -r \
    agents/craftassist/requirements.txt
        
        
pip install -r requirements.txt
        
4. Instructions for running the Locobot agent Instructions for running the Craftassist agent
5. Tutorials, runnable in Google Colab (more coming soon).

The tutorials introduce the `base_agent` architecture and take you through the 4 components of an Agent

6. API Documentation
7. Agent-specific API Documentation

Locobot agent API

CraftAssist agent API

Not Applicable

Documentation, Tutorials and Papers

Two papers cover the design of droidlet:

  1. droidlet: modular, heterogenous, multi-modal agents covers the overall design of droidlet as an embodied AI platform that is extensible to physical robots and simulators.
  2. CraftAssist: A Framework for Dialogue-enabled Interactive Agents covers the design of the dialogue parser and the task system of an earlier version of droidlet that is specific to the game Minecraft

Citation

If you use droidlet in your work, please cite our arXiv paper:

@misc{pratik2021droidlet,
      title={droidlet: modular, heterogenous, multi-modal agents}, 
      author={Anurag Pratik and Soumith Chintala and Kavya Srinet and Dhiraj Gandhi and Rebecca Qian and Yuxuan Sun and Ryan Drew and Sara Elkafrawy and Anoushka Tiwari and Tucker Hart and Mary Williamson and Abhinav Gupta and Arthur Szlam},
      year={2021},
      eprint={2101.10384},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

License

droidlet is MIT licensed.

Other Links

Datasets

Download links to the datasets described in section 6 of Technical Whitepaper are provided here:

In the root of each tarball is a README that details the file structure contained within.

droidlet's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.