Giter Site home page Giter Site logo

ryhkml / fine-tune-forge Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 1.0 1.39 MB

JSONL generator designed to elevate the fine-tuning process of cutting-edge language models like Google's PaLM 2 and OpenAI's GPT-3.5

License: MIT License

Dockerfile 1.52% Shell 0.30% TypeScript 52.47% HTML 17.38% Less 1.48% CSS 26.84%
jsonl gpt-3 tools openai vertexai image-ocr localhost text-bison

fine-tune-forge's Introduction

FineTuneForge

FineTuneForge is a tool designed specifically for generating JSON Lines (JSONL) to facilitate the fine-tuning of AI language models like Google's PaLM 2 and OpenAI's GPT-3.5. It enables developers to easily transform text data into a JSONL format that machines can read.

Screenshot FineTuneForge Webapp

Getting Started

To get started with FineTuneForge, follow these steps:

Installation

git clone https://github.com/ryhkml/fine-tune-forge.git
cd fine-tune-forge
chmod +x ./install.sh
./install.sh

Usage

Run the JSONL generator with the following command:

npm run build

Serve server

npm run serve

Directory Structure

FineTuneForge is organized into several directories, each serving a specific purpose in the workflow of the JSONL generator. Below is an overview of these directories and their intended use:

  • DATADOC_OCR: This directory acts as a temporary storage for OCR (Optical Character Recognition) images
  • DATASET: The DATASET directory is the designated location for storing the completed dataset files. Once the JSONL files have been generated and are ready for use in fine-tuning the language models, they are placed in this directory
  • DATATMP: This directory for temporary storage of instruction content
  • tls: This directory is reserved for storing SSL/TLS certificates

Configuring SSL/TLS for HTTPS

To enable HTTPS in the application, you need to configure SSL/TLS certificates correctly.

Required Files

Before you start, ensure you have the following files placed in the tls directory:

  • fullchain.pem: This is your certificate file that contains the full chain of trust, including any intermediate certificates along with your own
  • cert-key.pem: This file contains your private key and must be kept secure. It is used to establish the encrypted connection
  • ca.crt (optional): This Certificate Authority (CA) file is used if you need to specify an external CA

If you use docker, uncomment the environment variable PROTOCOL_SERVER in docker-compose.yaml

License

This project is licensed under the MIT License - see the LICENSE file for details.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.