Comments (3)
- vote for website content extraction.
Document loading I can solve my self by uploading files to appropriate blob container. Website content extraction needs more specification about supported formats (html, txt, json, etc). Ideally is to provide ready scraper script/function.
I my case I have access to database and can extract content via SQL queries, but supported output format is not clear yet.
from pubsec-info-assistant.
@dmitri012, the supported document formats are available at https://github.com/microsoft/PubSec-Info-Assistant/blob/main/docs/features/features.md#supported-document-types
from pubsec-info-assistant.
I'd really like to see this take priority as it is a requirement of just about every customer my team works with. Usually when we tell them this feature is not available in this repo, they use a different repo that already has this feature and they mis out on all the great work that has gone into this repo.
from pubsec-info-assistant.
Related Issues (20)
- environ vars HOT 1
- .xlsx file extension not uploading HOT 3
- Image is taking up 13GB and doesn't even work HOT 2
- With GPT-4 less than 2mb PDF file Failing - InvalidRequestError maximum context length HOT 8
- Loading module from “http://127.0.0.1:9000/src/index.tsx” was blocked because of a disallowed MIME type (“application/octet-stream”). HOT 3
- another day another bug, OPENAI_API_KEY is not read from env variable HOT 3
- embeddings HOT 1
- Not able to deploy the code HOT 2
- Unable to deploy - "No Language set, please check local.env.example for DEFAULT_LANGUAGE" HOT 1
- Upload of Json file hanging HOT 5
- Azure Gov Deployment: App Registration Redirect URI pointing to ".net" instead of ".us" HOT 1
- Gov deployment embedding error
- File still stuck on Queued HOT 2
- Information Assistant web app (rel 1.0) responses include unrelated content in thought process HOT 3
- Queued Error when uploading Json or CSV HOT 4
- Read Time Out after Batch Testing HOT 3
- separate admin and end user experience
- Deployment error for V1.1 : CredentialInvalidLifetimeAsPerAppPolicy: Credential lifetime exceeds the max value allowed as per assigned HOT 3
- Mobile ui
- Work & Work
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pubsec-info-assistant.