Comments (9)
We agree and understand the difficulties. Our research division is currently working on an efficient way to use the 100B model without the need to download and store all of it. We expect it to become available until the end of summer. We'll make a separate announcement to let you know you can use it.
from yalm-100b.
Приведите пример пожалуйста, о каких 100млд параметрах идёт речь?
from yalm-100b.
We agree and understand the difficulties. Our research division is currently working on an efficient way to use the 100B model without the need to download and store all of it. We expect it to become available until the end of summer. We'll make a separate announcement to let you know you can use it.
Please consider access for third-party developers for example, via Yandex Cloud API, and not simple web page like Balaboba.
from yalm-100b.
It seems that YALM is used in Yandex Alice, which is located in the Yandex mobile application, so I think you can test it there
from yalm-100b.
@TheDanikReal Yandex Alice - it is marketing product, its used a lot of components
from yalm-100b.
We agree and understand the difficulties. Our research division is currently working on an efficient way to use the 100B model without the need to download and store all of it. We expect it to become available until the end of summer. We'll make a separate announcement to let you know you can use it.
Я правильно понимаю, это та самая модель, на которой работала балабоба?
from yalm-100b.
Приведите пример пожалуйста, о каких 100млд параметрах идёт речь?
- @mr-troll, не пишите не по теме в данном треде
- Пожалуйста, изучите терминологию машинного обучения и как это работает
from yalm-100b.
We agree and understand the difficulties. Our research division is currently working on an efficient way to use the 100B model without the need to download and store all of it. We expect it to become available until the end of summer. We'll make a separate announcement to let you know you can use it.
Я правильно понимаю, это та самая модель, на которой работала балабоба?
В демке Балабоба использовалась тоже модель из семейства YaLM, но меньше — 3B параметров
English: In the Balaboba online demo we also used a model from the YaLM family but is was significantly smaller with only 3B parameters.
from yalm-100b.
It seems that YALM is used in Yandex Alice, which is located in the Yandex mobile application, so I think you can test it there
Yandex Alice uses smaller (up to 3B parameters) YaLM models for its open domain conversation capabilities.
from yalm-100b.
Related Issues (20)
- Привет HOT 2
- How did you used LAMB optimizer with ZeRO CPU offload? HOT 2
- Run on networked nodes
- AWS HOT 1
- Could you share the md5 value for those checkpoints? HOT 2
- Can it be launched on usual VPS? For example, 6 CPU 16 RAM (usual chips) HOT 2
- Would it be possible to run the model on single A100 (40GB) or 2xV100 (32GB) ? HOT 2
- No mention of `bfloat16` in source, and yet weights are `bfloat16`
- CUDA out of memory HOT 6
- NCCL error HOT 1
- PCI x1 or PCI x16 for GPU
- Is there any plans for making cloud service? HOT 1
- Has anyone deployed it on 10x 3090 ? Or any similar configuration? HOT 1
- Provide pruned version for weaker hardware HOT 2
- Citation bibtex? HOT 2
- Request to Open "Russian Pile" Dataset for Public Access
- How to use it with LangChain? HOT 2
- Timeout on 8 x RTX A6000 HOT 2
- Why usage ssh-agent and openssh-client package in docker
- gguf / mlx format?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from yalm-100b.