handle's People
Forkers
marcohatranhandle's Issues
Create default regex tests
Create tests of default regex patterns for the application that would be useful users unfamiliar with regexes. Consider each regex by what it means to a human reader, not what the character pattern looks like. For example, "keep contractions together", not "word as a set of word characters plus single quotes."
Some examples:
- All unicode "word" characters
- Hyphenation-preserving regexes
- Contraction-preserving regexes
Acceptance tests:
- Set of patterns with meaningful names
- Comprehensive suite of tests for all regexes that builds a
SpacePreservingLexer
with each regex and tests several cases including edge cases.
Separate SpacePreservingLexer tests into tests of basic lexing function and tests that perform actual application logic
Create new sets of tests for SpacePreservingLexer that test basic functionality of preserving gaps between what matches the pattern. Extract those tests that test specific regexes to another test suite.
Test and decide on default regex for lexers
The default regex choice for lexers is inconsistent throughout the code. Make these consistent.
Remove punctuation in word count table
SpacePreservingLexer still includes punctuation and some other items in the word count table
Mark spaces and removed characters in Preview window
Mark spaces and other removed characters in Preview window.
Key requirements:
- Removed characters can include accented characters which can appear in the middle of a word. It must be clear that a character will be removed.
- This should be accessible to colorblind users.
Some ideas:
- Excluded characters can be struck through like stopwords
- Excluded characters could appear in a different color that is very different from the surrounding character if we can find a working color combination
Preview window shows incorrectly-encoded characters on Windows
On Windows, Japanese text encoded in UTF-8 does not render correctly in the preview window.
Removed documents still show in topic models
When you add and then remove a document, it will still be included in a topic model.
Steps to reproduce:
- Add a few documents to the document import list
- Remove some of them
- Create a new topic model
- Select the topic model in the list of topics models
- Click the "Docs" tab
All the documents you added in step 1 will be listed
Save and load projects
Allow users to save the state of a project and load it back in. This has a few stages:
- Design a file format for saving documents and topic models
- Implement saving and loading
- Write comprehensive suite of tests for saving and restoring tests
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.