Giter Site home page Giter Site logo

jsonrepairsharp's Introduction

.NET License

JSOn Repair logo

JsonRepair Sharp

Jsonrepair Sharp is a near-literal translation of the TypeScript JsonRepair library, see https://github.com/josdejong/jsonrepair

The jsonrepair library is basically an extended JSON parser. It parses the provided JSON document character by character. When it encounters a non-valid JSON structures it wil look to see it it can reconstruct the intended JSON. For example, after encountering an opening bracket {, it expects a key wrapped in double quotes. When it encounters a key without quotes, or wrapped in other quote characters, it will change these to double quotes instead.

The library has many uses, such as:

  1. Convert from an a Word document
  2. Convert from objects with a JSON-like structure, such as Javascript
  3. Convert from a string containing a JSON document
  4. Convert from MongoDB output
  5. Convert from Newline Delimited JSON logs
  6. Convert from JSON dialects
  7. Convert from Truncated or corrupted JSON.

But with the advent of Language Model Models (LLMs) there is yet another use-case. LLMs are notoriously bad in consistently outputting well-formed datastructures, even as simple as JSON. Requiry-ing is expensive and time consuming. Jsonrepair comes to the rescue by repair these JSON modules files, and increasing changes of smooth processing.

The library can fix the following issues:

  • Add missing quotes around keys
  • Add missing escape characters
  • Add missing commas
  • Add missing closing brackets
  • Replace single quotes with double quotes
  • Replace special quote characters like โ€œ...โ€ with regular double quotes
  • Replace special white space characters with regular spaces
  • Replace Python constants None, True, and False with null, true, and false
  • Strip trailing commas
  • Strip comments like /* ... */ and // ...
  • Strip JSONP notation like callback({ ... })
  • Strip escape characters from an escaped string like {\"stringified\": \"content\"}
  • Strip MongoDB data types like NumberLong(2) and ISODate("2012-12-19T06:01:17.171Z")
  • Concatenate strings like "long text" + "more text on next line"
  • Turn newline delimited JSON into a valid JSON array, for example:
    { "id": 1, "name": "John" }
    { "id": 2, "name": "Sarah" }
    

In LLM mode

  • Strip characters heading the JSON opening brace and trailing the JSON closing brace.

Use

Use the original typescript version in a full-fledged application: https://jsoneditoronline.org Read the background article "How to fix JSON and validate it with ease"

Code example

// Enable throwing exceptions when JSON code can not be repaired or even understood (enabled by default)
JsonRepair.ThrowExceptions = true;

// Set context as LLM or Other. This will repair the json slightly differently. (Other by default)
JsonRepair.Context         = Other;

 try
 {
     // The following is invalid JSON: is consists of JSON contents copied from 
     // a JavaScript code base, where the keys are missing double quotes, 
     // and strings are using single quotes:
     string json = "{name: 'John'}";
     string repaired = JSONRepair.JsonRepair(json);
     Console.WriteLine(repaired);
     // Output: {"name": "John"}
 }
 catch (JSONRepairError err)
 {
     Console.WriteLine(err.Message);
     Console.WriteLine("Position: " + err.Data["Position"]);
 }

Command Line Interface (CLI)

The github archive comes with a jsonrepair cli tool, it can be used on the command line. To use, build JsonRepair-CLI.

Usage:

$ jsonrepair "inputfilename.json" {OPTIONS}

Options:

--version,   -v                       Show application version
--help,      -h                       Show help
--new,       -n "outputfilename.json" Write to new file
--overwrite, -o                       Replace the input file
--llm,       -l                       Parse in LLM mode

Example usage:

$ jsonrepair "broken.json"                         # Repair a file, output to console
$ jsonrepair "broken.json" > "repaired.json"       # Repair a file, output to command line and pipe to file
$ jsonrepair "broken.json" -n -l "repaired.json"   # Repair a file in LLM mode, output to command line and pipe to file
$ jsonrepair "broken.json" --overwrite             # Repair a file, replace the input json file

GUI

The archive also comes with a minimal GUI that shows a somewhat simplistic diff between the original and fixed JSON JSOn Repair GUI This GUI heavily leans on the awesome FastColoredTextBox library and the the diff sample in particular.

Alternatives:

Similar libraries:

Acknowledgements

Thanks go out to Jos de Jong, who not only did all the heavy lifting in the original jsonrepair for typescript library, but also patiently helped getting this port to pass all unit tests.

License

Released under the MITS license.

jsonrepairsharp's People

Contributors

thijse avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

jsonrepairsharp's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.