Giter Site home page Giter Site logo

discorddataparser's Introduction

what do i put here KPOP ANIME FORTNITE

discorddataparser's People

Contributors

brainicism avatar gibstick avatar pastmoments avatar taahamahdi avatar virenmohindra avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

discorddataparser's Issues

undefined local variable or method `__dir__' for #<ResultRenderer:0x0000000160f558>

Begin parsing activity...
Progress: 1/3 (events-2016-00000-of-00001.json)
Progress: 2/3 (events-2017-00000-of-00001.json)
Progress: 3/3 (events-2018-00000-of-00001.json)
Finished parsing activity...
undefined local variable or method `__dir__' for #<ResultRenderer:0x0000000160f558>

Running this on Window's Linux Subsystem.

 ruby app.rb --data-path="./discord-data-2018-08-06/"

There's only index.css in output/visualizations/ Other output folders seem populated.

Failed while parsing messages

Progress: 303/484 (Direct Message with bxllona / tchai#6555)
No such file or directory - ./output/prettified/messages/Direct Message with bxllona / tchai#6555_387451607731666944.txt.txt

This particular issue seemed to pass when I removed the entry from index.json, and @taahamahdi got it to run it after removing the entry.

Output folder isn't created if it doesn't exist

No such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csvNo such file or directory @ rb_sysopen - ./output/messages/by_date.csv

Refactor null data point filling methods

fill_messages_by_date, fill_messages_by_time_of_day , and fill_messages_by_day_of_week inside message_by_date_processor.rb. Each method sorts the array twice while trying to insert 0 values for dates without data points. This can be alternatively be done in-place. Each method serves a very similar purpose, and may be able to be combined into one.

Add "commonly used phrases"

Instead of only looking at individual words, find occurances of combination of words (of varying length) in messages.

i.e.
"Damn I love anime."
"I love anime."
"Do you know I love anime?"

"I love anime" would occur 3 times.

Analyze "Message by Time of Day" but by the time at the local location and timezone it was sent from.

For anyone that feel like messing with timezones and enjoy tearing their hair out.

This should be a seperate analysis from the current "Message by Time of Day".

As an example, let's say I send 5 messages.
"I love you" sent from 16:00 UTC-5 (21:00 UTC)
"Just kidding" sent from16:00 UTC-3 (19:00 UTC)
"Heyyyy" sent from 16:00 UTC-8 (00:00 UTC)
"Just chillin wbu" sent from 19:00 UTC-5 (00:00 UTC)
"I have a bf" sent from 19:00 UTC-2 (21:00 UTC)

The "Message by Time of Day" would count 2 of (21:00 UTC), 2 of (00:00 UTC), and 1 of (19:00 UTC).

What I am suggesting is for something to count 3 of (16:00 local) and 2 of (19:00 local).
This could be done by looking at the location that the message was sent from, then figuring out the local timezone from that.

Bonus points for looking at the date of the message and figuring out Daylight Savings and the many quirks of timezones.

This is relevant for co-op students and people that travel often.

Not being able to recognize data-path?

Am I missing something? I cannot analyze the contents of the package after downloading and unzipping the email from Discord, it throws a directory not available when I can definitely cd into it.

screen shot 2018-09-02 at 2 38 46 am

Output is in ascii (ANSI?) instead of UTF-8

image
This also happens when I open the prettyified message files in notepad++, until I choose "encode as UTF-8.

I'm not sure if there's meta data in these files to tell you what encoding it is, this might be related to #14

Incorrect Output?

image

Hi, So i just checked this output for my most active threads, and it looks like this one group chat is the most active, but I wanted to make sure that this was right and to my shock and surprise (gasp), I found that I had actually sent about 200,000 messages in this one channel, but alas its output was nowhere to be seen.

Here is a screenshot of a csv that Discord sent me.
image

Is there like a cap on the amount of messages that can be parsed or something? Because it seems like this epic thing is being ignored.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.