Giter Site home page Giter Site logo

conceptsplitter's Introduction

conceptsplitter

conceptsplitter is a project designed to extract atomic concepts from a given text. It utilizes the OpenAI API to analyze and split the text into distinct core concepts, ensuring that each concept is clearly defined and elaborated upon. The project is particularly useful for extracting and organizing information from large text dumps, such as those from websites.

Features

  • Extract atomic concepts from a given text.
  • Utilizes OpenAI API for sophisticated parsing.
  • Saves extracted concepts as individual text files.
  • Provides a clear structure for analyzing and splitting text based on specific guidelines.

Directory Structure

.
├── .gitignore           # Git ignore file
├── _.env                # Environment variables (contains OpenAI API key)
├── concept_split.ppt    # Guidelines and example for text analysis
├── interface.py         # Interface to OpenAI API and core functions
├── requirements.txt     # Project dependencies
└── split_concepts.py    # Main script to process and save extracted concepts

Setup

  1. Clone the repository:

    git clone [repository_url]
    cd conceptsplitter
  2. Install the required packages:

    pip install -r requirements.txt
  3. Set up your OpenAI API key:

    • Obtain your OpenAI API key from the OpenAI platform.
    • Open the _.env file and replace your_api_key_here with your actual API key.
    • rename the file to .env (remove the underscore).

Usage

  1. Place the text files you want to process in the ~/Downloads directory. Ensure that the filenames start with the prefix text-.

  2. Run the main script:

    python split_concepts.py
  3. Extracted concepts will be saved as individual text files in the ~/desktop/atomic_notes directory.

Related Tools

Note Utilities Ecosystem: A suite of tools designed to streamline and enhance your note-taking and information processing workflows.

  • workflowlibrary - Centralizes and synchronizes the "Related Tools" section across the ecosystem.
  • noteutilsyncer - A centralized tool that automates the synchronization of the "Related Tools" section across READMEs in the noteutils ecosystem.
  • conceptsplitter - Extract atomic concepts from a given text using the OpenAI API.
  • textdownloader - A browser extension to automatically generate text dumps for processing.

Contributing

Contributions to the conceptsplitter project or the note utilities ecosystem are welcome. If you have ideas for improvements or new features, please feel free to submit issues, suggestions, or pull requests in this repository or contact me!

License

The textdownloader browser extension is open-source and licensed under the GOS License.

Credits

The conceptsplitter project is developed and maintained by Martin Christoph Frank. If you have any questions or need assistance, please contact [email protected].

conceptsplitter's People

Contributors

m-c-frank avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.