Comments (2)
RWKV-LM can be used throughout the project to:
- summarize texts associated with the screenshots
- better suggest prompts (like google's autofill)
- better understand prompts from the user
- in the future: handle translations
Implementation:
- can get "infinite" context lengths through iterative refinement
- states would be the recording up until that point and the token would be the next screenshot
Limitation:
- this process can be computationally expensive
from openadapt.
As Dian mentioned above, RWKV-LM could be helpful for us since it could analyze large amounts of past user actions to better predict future actions. It could also learn how to do more complex tasks that the user themselves don't know how to do, as it could learn from longer texts like instruction manuals. These benefits would also help with efficiency and speed up the automation.
As for implementation, we could consider combining RWKV-LM with another tool that could help with the graphical side of things. For example, we could have RWKV_LM understand the task we wish to execute and lay out the steps needed, then using something like OCR or MiniGPT-4, we could handle how to execute the steps on the GUI.
from openadapt.
Related Issues (20)
- Implement VISPROG
- Implement `openadapt.adapters.groq`
- Avoid unnecessary segmentation + description in `VisualReplayStrategy` HOT 1
- Implement Instructor for structured LLM outputs
- Anthropic image descriptions out of order
- Implement Reka Core adapter
- Implement NanoLLM completion adapter
- Implement NanoLLaVA completion adapter
- Implement Autodistill
- Support winget HOT 1
- [Bug]: manual installation on Windows fails on `poetry shell`
- please make docker type installation for ease. HOT 2
- please add python3.12 support HOT 2
- please add install instructions for linux on the `https://openadapt.ai/#start` page HOT 2
- [Bug]: from matplotlib._path import ( ImportError: DLL load failed while importing _path: The specified module could not be found.
- Regarding the warning and the GUI not visible
- Implement prettier for dashboard
- Implement MiniCPM-V-2
- [Bug]: fix notification icons
- Trigger replay programmatically
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openadapt.