Comments (5)
I think thats a great idee. Tried to use the collocations function with my jsonl data this weekend and thought about what output could be good. Probably it will be useful to add an option for an csv output for the results later, too.
I think it is not that bad if we work on something in parallel. I would not have understood what you did, if I hadn’t been working on the tests before. And in the end it can be good to compare solutions and pick the better option. And I think with the extended code it will be less likely, that we work on exactly the same lines.
But anyway, I will not work on the results/output question today.
from pycollocation.
Okay, I did not have much time today, but I have implemented a basic version that can still be started via test.py
for the moment (need to change that). It only prints the dataframe, but we can work on that, too. Also, I thought we should soon add more sophisticated test data. I will continue working on this tomorrow.
from pycollocation.
That is true! The great advantage of using pandas is that we can easily convert the dataframes to whatever format we like. So, instead of printing the frame we could also say to_csv()
or to_markdown()
and return the corresponding format. Yet, in this case we should rename the function since it is no longer a mere "display" function. I think what would also be important is to add some statistic measures such as log-lik soon. Maybe we can create a subfolder statistics
or something, but this should be a separate feature/issue.
from pycollocation.
Would it make sense to call it results or return_results? Then shell output could be the default and other options added.
But probably this depends on how we want the interface to the package be in the end. Do we want collections, analyses, display/results to be accessed directly, or funnel it through one command that allows to access the different functions and offers different ways of receiving the data?
from pycollocation.
Good question... I personally think it would make sense to pipe everything, maybe even offering to start the programm from a command line. I mean, we have one particular functionality. But what do you think?
from pycollocation.
Related Issues (14)
- Statistics HOT 3
- Program Structure HOT 6
- CLI HOT 1
- implementing stop words HOT 2
- Create a html GUI HOT 5
- Error when Counters are empty HOT 9
- Unit tests HOT 18
- Creating twitter adapter HOT 4
- Add tokenizer HOT 3
- Searching with wildcards? HOT 3
- Visualization/Output HOT 1
- package structure
- TODO (General List) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pycollocation.