Giter Site home page Giter Site logo

kamus_kbba's Introduction

kamus_kbba

Kamus Bahasa Bahasa Alay.

kamus bahasa indonesia khusus bahasa alay (slang) untuk melakukan analisis sentimen pada tahap slang word standardization.

example of how to use:

table :

content_clean
0 ko jadi ngaco ya sekarang udh harus beli tiket...
1 tingkatin performa dong dari gua jaman maba am...

code :

kbba_dictionary = pd.read_csv('https://raw.githubusercontent.com/insomniagung/kamus_kbba/main/kbba.txt', delimiter='\t', names=['slang', 'formal'], header=None, encoding='utf-8')

slang_dict = dict(zip(kbba_dictionary['slang'], kbba_dictionary['formal']))
kbba_dictionary

def convert_slangword(text):
    words = text.split()
    
    normalized_words = [slang_dict[word] if word in slang_dict else word for word in words]
    normalized_text = ' '.join(normalized_words)
    return normalized_text

df['content_clean'] = df['content_clean'].apply(convert_slangword)

df[['content_clean']]

table :

content_clean
0 kok jadi kacau iya sekarang sudah harus beli t...
1 tingkatkan performa dong dari saya zaman mahas...

terdapat penambahan kata. silakan gunakan dengan bijak.

credit and thanks to https://github.com/ramaprakoso/analisis-sentimen/blob/master/kamus/kbba.txt

kamus_kbba's People

Contributors

insomniagung avatar sadammahendra avatar firmanxyz avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.