Topic: unstructured-data Goto Github
Some thing interesting about unstructured-data
Some thing interesting about unstructured-data
unstructured-data,Manage unstructured and multimodal datasets!
Organization: aclai-lab
unstructured-data,Adansons Base is a data programming tool for error-analysis of training results. It organizes metadata of unstructured data and creates and organizes datasets. It makes dataset creation more effective and helps to find low-quality data by using the training results and improves AI performance.
Organization: adansons
Home Page: https://adansons.wraptas.site/
unstructured-data,Enforce structured output from LLMs 100% of the time
Organization: automorphic-ai
Home Page: https://automorphic.ai
unstructured-data,Code and walkthrough to build an end-to-end content repository for unstructured data with dynamic access control.
Organization: aws-samples
unstructured-data,Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.
User: bartjongejan
unstructured-data,Building Knowledge Graphs from Unstructured Text
User: chaitjo
unstructured-data,A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
Organization: dingodb
Home Page: https://www.dingodb.com
unstructured-data, Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
Organization: eulersearch
Home Page: https://embeddingstud.io/
unstructured-data,Extract tabular information from scanned documents (PDF to CSV)
User: floriancochard
unstructured-data,python implementation of jordansissel's grok regular expression library
User: garyelephant
unstructured-data,How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This pattern helps in establishing relations between structured & unstructured data to generate recommendations using Watson NLU & Watson Studio.
Organization: ibm
unstructured-data,A Jupyter notebook that uses the Watson Visual Recognition and Natural Language Understanding services to enrich Facebook Analytics and uses Cognos Dashboard Embedded to explore and visualize the results in Watson Studio
Organization: ibm
Home Page: https://developer.ibm.com/patterns/discover-hidden-facebook-usage-insights/
unstructured-data,📺 Instill AI's official command line tool
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,⇋ A REST/gRPC server for Instill AI's data connector service
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,⛅ Versatile Data Pipeline (VDP) console website
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,🔮 Instill Core contains components for supporting Instill VDP and Instill Model
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,⚗️ Instill Model contains components for AI model orchestration
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,🔮 Instill Core is an open-source no-/low-code data, model, and pipeline orchestration platform
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,⇋ A REST/gRPC server for Instill Model API service
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,⇋ A REST/gRPC server for Instill VDP API service
Organization: instill-ai
Home Page: https://www.instill.tech
unstructured-data,This repository contains code and resources for detecting tables in various types of documents using machine learning and computer vision techniques.
User: inuwamobarak
unstructured-data,RL3 examples repository (information extraction, NER, NLP, web & text mining, etc).
User: jokruger
unstructured-data,Dynamic Kernel Matching (DKM) for Classifying Data with Non-conforming Features
User: jostmey
unstructured-data,Kodexa Python Client
Organization: kodexa-ai
Home Page: https://kodexa.ai
unstructured-data,Curate better data for LLMs
Organization: lilacai
Home Page: http://lilacml.com
unstructured-data,Gotz - Heavy duty ETL to automate data extraction from tons of HTML pages
User: maithilish
unstructured-data,ETL-Texts aims to be a simple and efficient pipeline designed for extracting, translating, cleaning, and transforming text files.
User: mazzasaverio
unstructured-data,Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Organization: milvus-io
Home Page: https://milvus.io
unstructured-data,Web Data Frames
User: mkearney
Home Page: https://wibble.mikewk.com
unstructured-data,Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention Sample Data Set Details: Resumes and financial documents
User: moindalvs
unstructured-data,The infoZilla unstructured software engineering data mining tool. It can find and extract source code regions, patches, stack traces, enumerations and itemizations from discussion threads.
User: nicbet
unstructured-data,Interact, analyze and structure massive text, image, embedding, audio and video datasets
Organization: nomic-ai
Home Page: https://atlas.nomic.ai
unstructured-data,ACID compliant JSON document-based database engine with SQL language, APIs and GUI.
User: ntdls
Home Page: https://katzebase.com/
unstructured-data,NucliaDB, The AI Search database for RAG
Organization: nuclia
Home Page: https://docs.nuclia.dev/docs/docs/nucliadb/intro
unstructured-data,Home of the AI workforce - Multi-agent system, AI agents & tools
Organization: relevanceai
Home Page: https://sdk.relevanceai.com
unstructured-data,Interactively explore unstructured datasets from your dataframe.
User: renumics
Home Page: https://renumics.com
unstructured-data,A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension
User: sachinkalsi
unstructured-data,Spark RDD transformation and action, process unstructured data
User: saranpal
unstructured-data,For this problem, we proposed the use of bidirectional-LSTM’s(Long Short Term Memory) with 1-D CNN layer to classify patient notes at character level and at word level. The 1-D CNN is employed to scale back the training time. In order to improve the performance, we will also feed the network combined word embedding consisting of Pre-trained word2vec 100 dimension word embedding trained on the Twitter ADR Dataset database and character embedding generated by a Char-CNN for Named Entity Recognition
User: sumanismcse
unstructured-data,An Annotation Tool Designed for Health Unstructured Data (标注工具)
Organization: thu-west
unstructured-data,Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Organization: towhee-io
Home Page: https://towhee.io
unstructured-data,A curated list of resources for Document Understanding (DU) topic
User: tstanislawek
unstructured-data,💙 Unstructured Data Connectors for Haystack 2.0
User: tuanacelik
Home Page: https://haystack.deepset.ai/integrations
unstructured-data,The open-source tool for building high-quality datasets and computer vision models
Organization: voxel51
Home Page: https://fiftyone.ai
unstructured-data,Named Entity Recognition (NER) using LSTMs with Keras
User: yrnigam
unstructured-data,No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Organization: zipstack
Home Page: https://unstract.com
unstructured-data,Unstract's interface to LLMs, Embeddings and VectorDBs.
Organization: zipstack
Home Page: https://unstract.com
unstructured-data,A framework for writing Unstract Tools/Apps
Organization: zipstack
Home Page: https://unstract.com
unstructured-data,The RL3 Standard Library is a collection of modules accessible to a RL3 program to simplify the programming process and removing the need to rewrite commonly used RL3 patterns and predicates.
Organization: zorallabs
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.