Giter Site home page Giter Site logo

newsdiffhk's Introduction

NewsDiff

找出新聞修改的記錄

成果 http://newsdiff.g0v.ronny.tw/

以上程式碼以 BSD License 公開

程式說明

  • webdata/scripts/table-build.php
  • 建立資料表
  • webdata/scripts/crawler-new.php
  • 從所有新聞來源取得個別新聞的網址
  • webdata/scripts/crawler-one.php
  • 逐一取得新聞內容,因為執行效率不好所以不再使用
  • webdata/scripts/crawler-part.php
  • 同時執行多個程序來取得新聞內容,第二個參數是總數(分母),第一個參數是餘數。假設要同時執行兩個程序,參數就是 0 2 與 1 2
  • webdata/scripts/exporter.php
  • 打包新聞內容

執行範例

*/10 * * * * php webdata/scripts/crawler-new.php
* * * * * php webdata/scripts/crawler-part.php 0 2
* * * * * php webdata/scripts/crawler-part.php 1 2

newsdiffhk's People

Contributors

ronnywang avatar yookoala avatar kenyiu avatar kiang avatar youchenlee avatar

Stargazers

HU, Pili avatar Jack Tang avatar Mart van de Ven avatar Chunkit Ho avatar

Watchers

 avatar  avatar James Cloos avatar Chunkit Ho avatar

Forkers

yookoala

newsdiffhk's Issues

Category of articles

Use the original category from the media site, and translate/transform into general words in topics/cat.

i.e.: Politics, Social Issues...

Also a tagcloud is needed to provision how the changes going.

TVB Video News

Many news from TVB site contains solely videos, which makes the content uncaught.

The program should be able to catch the link of the video, and check if the video link has been changed.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.