Giter Site home page Giter Site logo

big_train's People

Contributors

mintisma avatar

Watchers

 avatar  avatar

big_train's Issues

Project Review from TalkingData Data Scientists

Hi, 我是来自TD的数据科学家。
关于你的项目,有几个问题想问下:

  1. 首先是项目的功能,你写的是生成公司的风险系数,然后这个风险系数等价于情感得分,这个我觉得可以解释下具体含义,有点confusing。
  2. 根据你对计算过程的描述,我的理解是,你会先分句分词,然后根据词在句子中的tfidf,提取关键词,并根据关键词提取关键句(其实这个步骤是一种摘要提取方法),最后根据关键句构建词集,然后根据每个词的情感得分,计算出总得分。
    我想问的是:你觉得这样用tfidf算出来的分数合理吗,为什么要筛选出关键句,直接用整个文本为什么不行
  3. 我看你用了很多的processpoolexecutor,你知道processpoolexecutor和threadpoolexecutor的区别吗

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.