Comments (7)
If we can do that, why not just contribute to Brasil.IO which is open source, open data and hosting tons of datasets already? I think making them better is a win-win : )
from perfil-politico.
Hi @cuducos that would be nice too! I just don't know if this projetct should walk in parallel with brasil.io or if they should be independent from each other.
I think the community could contribute getting the data for brasil.io, but maybe the other contributors think is better to have the data in another place.
I really don't what is the best choice.
from perfil-politico.
I just don't know if this projetct should walk in parallel with brasil.io or if they should be independent from each other.
My opinion on that is that preferably they should walk in parallel. Open-source in general (at least on the level we are in Open Knowledge Brasil and in Brasil.IO) usually suffer from a lack of support and hands-on to keep things up and running. Thus, bringing data collections, cleaning, and treatment to the scope of this project mean two things:
- increasing its complexity (or, in other words, risking decreasing it maintainability)
- capturing some volunteer (or even paid) work hours that would considerably help another open-source project (namely, Brasil.IO)
I think both consequences are not what Open Knowledge and Brasil.IO aspire for.
Surely I hear you: depending on an external project might make things move slower here — and I do agree with that. But I wouldn't assume the two consequences below before hearing from @turicas, for example, that Brasil.IO, for whatever reason, cannot accommodate contributions in a timely manner.
TLDR Getting back to the opening statement, data comes from brasil.io website and they are not updated frequently, I believe we could use this stamina to help them update the data we need more often instead of increasing the complexity of this project : )
from perfil-politico.
I got it @cuducos, thanks for your considerations.
Thinking now maybe I was little bit harsh with this statement.
It wasn't my intention, I know how difficult is to work with open data specially when it depends on the goodwill of the public organizations. Besides that I imagine how hard is to get data for brasil.io specially now with the efforts aimed at covid's data.
I think you made really good points and we can help to update the data that is necessary for this project.
from perfil-politico.
If we can do that, why not just contribute to Brasil.IO which is open source, open data and hosting tons of datasets already? I think making them better is a win-win : )
Because I agree with you, I tried, really, but the onboarding process is very difficult.
In several failed attempts, during this week, I fell into a hell of unresolved dependencies. And the issue has six days without any response. turicas/brasil.io#388
from perfil-politico.
Just a side note on Brasil.IO: this is not the repository I was suggesting to work on. This is the source code of their website, a Django instance with the dashboard, API and so on. Each dataset has its own repository with the code to collect and clean data.
If we go, for example, to their Eleições Brasil datasets page, there is the URL for the repository that collects and cleans these datasets: github.com/turicas/eleicoes-brasil
.
from perfil-politico.
Excellent and necessary guidance, thank you.
from perfil-politico.
Related Issues (20)
- [Integração de dados] Implementar live-update do comando load_bills HOT 1
- [Raspador Legislativo][spike] Discutir com cuducos solução para o crawler do senado
- [Raspador Legislativo] Corrigir spider do senado HOT 7
- [Otimização][spike] Investigar otimização do comando load_affiliations
- [Otimização] Otimizar o comando load_affiliations
- [Documentação] Documentar fontes de dados
- [Integração de dados][spike] investigar novos dados de candidatos HOT 3
- [Integração de dados][spike] investigar novos dados de bens HOT 4
- [Integração de dados] Atualizar eleições brasil para extrair candidatos de 2022
- [Integração de dados] Atualizar eleições brasil para extrair bens candidatos de 2022
- [Integração de dados] Carregar dados de candidatos de 2022 no servidor HOT 1
- [Integração de dados] Carregar dados de bens de 2022 no servidor HOT 3
- [Consistência de dados] Dados de candidatos duplicados HOT 4
- [API] Adicionar campo status e round nas requests de candidatos
- [Infra] Criar ambiente de staging HOT 3
- [Integração de dados] Atualizar eleições brasil para extrair dados de filiação de 2022
- [Integração de dados][spike] investigar novos dados de filiação HOT 2
- [Integração de dados] Atualizar fotos de candidatos de 2022 HOT 1
- [Integração de dados] Carregar dados de filiados de 2022 no servidor HOT 2
- [Integração de dados] Atualizar o banco de produção com os dados de staging HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from perfil-politico.