jakartaresearch / maleo Goto Github PK
View Code? Open in Web Editor NEWWrapper library for text cleansing, preprocessing in NLP
Home Page: https://jakartaresearch.github.io/maleo/
License: MIT License
Wrapper library for text cleansing, preprocessing in NLP
Home Page: https://jakartaresearch.github.io/maleo/
License: MIT License
jadi daripada di remove angkanya, mending di encode misal 123 > xxx
atau no tlpn 0123 4567 8901 >
Use Indonesian stopword dictionary, must prepare it first
INPUT = ๐
OUTPUT = >EMOJI<
when using emoji_to_word
the converted emoji has no whitespace to separate each emoji.
"untuk besok komen disini ya.. ๐๐๐๐๐๐๐ https://t.cowok/nxeojVug3z "
utk besok komen disini ya.. "smiling_face_with_smiling_eyessmiling_face_with_smiling_eyessmiling_face_with_smiling_eyessmiling_face_with_smiling_eyesfolded_handsfolded_handsfolded_hands https://t.co/nxeojVug3z"
"smiling_face_with_smiling_eyes smiling_face_with_smiling_eyes smiling_face_with_smiling_eyes smiling_face_with_smiling_eyes folded_hands folded_hands folded_hands https://t.co/nxeojVug3z"
So that, we can tokenize the emoji word
add Beautiful Soup library
pip install beautifulsoup4
Explore tools for create documentation
Option:
Choosen --> Pdoc
Currently, already have one slang-formal dictionary but need to check and update
toggle on off for (remove number yes or no)
POS Tagging model
https://github.com/jakartaresearch/maleo/releases/tag/v0.0.6.2
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.