Giter Site home page Giter Site logo

Comments (2)

ssokolow avatar ssokolow commented on June 3, 2024

It's not that simple. Often, problems arise after the sanitization/validation stage based on how the components of the systems interact and, if you just strip characters like that, you wind up with Fanfiction.net converting 100% to 100 in posted chapters or various sites converting O'Henry to OHenry in form fields.

Languages already have libraries for sanitizing stuff that's truly problematic and, for the rest, you really need functional tests which feed the data through the entire process to verify that it produced the proper effects. (And blns.json is specifically intended as easy-to-use input for those case-specific tests once you've written them.)

However, what would help is harnesses like the one I wrote for testing the heuristics in https://github.com/ssokolow/game_launcher where it's one test which, on failure, prints out a chart of all of the inputs which didn't produce the expected outputs.

from big-list-of-naughty-strings.

 avatar commented on June 3, 2024

Good point... I may fork the repo and try some tests regarding this (In javascript), but I had an idea of a more or less customizable library, that can selectively filter out the categories stored in the BLNS files (e.g. Reserved Words, Numeric, etc.) so it can be customized for each use.

from big-list-of-naughty-strings.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.