Comments (4)
不是单词, 算的就是句子的,lda输入的是 tf_ngram.T
self.sentences_cut = [" ".join(sc) for sc in self.sentences_cut]
# 计算每个句子的tf
vector_c = CountVectorizer(ngram_range=(1, 2), stop_words=self.stop_words)
tf_ngram = vector_c.fit_transform(self.sentences_cut)
...
res_lda_u = lda.fit_transform(***tf_ngram.T***)
from nlg-yongzhuo.
不是单词, 算的就是句子的,lda输入的是 tf_ngram.T
self.sentences_cut = [" ".join(sc) for sc in self.sentences_cut] # 计算每个句子的tf vector_c = CountVectorizer(ngram_range=(1, 2), stop_words=self.stop_words) tf_ngram = vector_c.fit_transform(self.sentences_cut) ... res_lda_u = lda.fit_transform(***tf_ngram.T***)
想问下res_lda_u和res_lda_v哪个是文档主题分布,哪个是主题-词分布
from nlg-yongzhuo.
res_lda_v是文档主题分布
from nlg-yongzhuo.
那就说的通了, 大佬是把一个句子作为lda中的"文档"这个思路抽取的吧
from nlg-yongzhuo.
Related Issues (15)
- topic_lda.py中的n_topics HOT 1
- 数据问题 HOT 2
- 意见反馈 HOT 3
- feature_base/text_teaser.py的问题 HOT 1
- 在import时出现问题:cannot import name '_get_n_jobs' HOT 2
- Lead-3 疑问 HOT 4
- ValueError: max_df corresponds to < documents than min_df HOT 1
- 语句排序的一些问题 HOT 1
- 关于模型加速的问题 HOT 1
- 怎么保留句子的原始顺序呢? HOT 2
- 使用pip安装过程中出现下列错误,请问如何解决? HOT 3
- pip 安装的问题 HOT 2
- 关于text pprocess问题 HOT 2
- 这个适配python的哪个版本啊,安装老是报错 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nlg-yongzhuo.