Giter Site home page Giter Site logo

my-first-blog's Introduction

Description of projects

A.google_weather_data.py

It contains automation code for preprocessing of GLOBAL HISTORICAL CLIMATOLOGY NETWORK data downloaded from google. The Global Historical Climatology Network (GHCN) is an integrated database of climate summaries from land surface stations across the globe that have been subjected to a common suite of quality assurance reviews.

it includes code for following steps

1.Dowloading of ghcnd_all.tar.gz(it contains".dly" files for all of GHCN-Daily) file from google link into downloads folder.

2.unzipping ghcnd_all.tar.gz file in the downloads folder.

3.Taking out all the .dly file (Each ".dly" file contains data for one station and Each record in a file contains one month of daily data) starts with IN or INM from ghcnd_all which was unzipped in second step and converted them in a csv format and save into a folder named newcsv.

4.Taken all the file from newcsv folder and preprocessed them in a good csv format and save into another folder newcsv2.

5.finally, All files from newcsv2 is merged into a single file and transferred into mysql database

B.Statewise_fertiliser_data_scrapping.py

This file contains code for scrapping of statewise fertiliser data from urvarak.com. It contains explanation of each step involved in the code.

1.In first step we took the url to be scrapped and clicked on the name statewise by using xpath of name statewise. Then we moved into second window which opened after clicking.

2.In the new window opened, we clicked on the each month and each year and submit button in a loop. Each time after submitting we get report of fertiliser data.

3.Finally after getting the report we scrapped the data from the page using beautiful soup and saved it to dataframe.

C.Companywise_fertiliser_data_scrapping.py

It contains same steps as mentioned in the satewise_fertiliser_data_scrapping.py

D.Structuring_nested_json_into_csv.py

In this project we extracted important information of all market from mandi profile data and structured into a good csv format.

1.we took out area_info, admin info, general_info, state-market info, commodities_info for each market and made separate dataframe for each type of info.

2.we combined area_info, admin_info, general_info and state_market info into a single dataframe. Since each market have more than one or none commodities info, so we duplicated each record or rows in the dataframe according to number of commodities came into each market.

3.Finally, we combined above dataframe with dataframe containing commodities info to a single required dataframe.

E. Merging_excelfile.py

This file contains code for extracting all useful information from excels file and strcuturing it to a good format and then merging them all into a single file.

1.we took each file by iteration and extracted state, district, subdistrict, village from there and made a separate column for each in the table.

2.Deleted all columns with na values and all unnecessary rows with non-required info.

3.Modified each column names,then merged all the files in a single one and transferred each file into mysql database simultaneously.

F.Scrapping_of_apis.py

In this project we scrapped api link from data.gov website. Each api link contains data of different types, for instance data for Current Daily Price of Various Commodities from Various Markets (Mandi), Farmers Queries in Kisan Call Centre (KCC) from KURNOOL district of ANDHRA PRADESH for the month of April 2018 etc. in csv or xml format.

1.At first we clicked on name of each api in a loop and moved to new window opened.

2.we extracted some text from there and combined this text with http://data.gov.in/ to get a link.

3.we parsed that link using beautiful soup and extracted an api link from there which contains data in csv or xml format.

4.Finally, we created a dataFrame containing name of each api, api link and year duration taken from the name of api.

my-first-blog's People

Contributors

rajiitkgp avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.