R/topic-analysis.R
primary_topics.Rd
primary_topics
summarizes for each topic the number of documents and
the respective mean topic share (gamma) where a topic is one of the three
primary topics in a document.
primary_topics(topicsByDocDate, minGamma = 0)
topicsByDocDate | a dataframe as returned by
|
---|---|
minGamma | the minimum share of a topic per document to be considered
when summarizing primary topic information; topics with smaller shares per
individual document will be ignored when summarizing the document counts
and mean topic shares. (In an |
a dataframe with 7 columns where:
a topic
ID as provided as an input in topicsByDocDate
number of documents where topic_id
has the largest
probability
number of documents where topic_id
has
the second largest probability
number of documents where
topic_id
has the third largest probability
mean
probability of all documents in n_docs_1
mean
probability of all documents in n_docs_2
mean
probability of all documents in n_docs_3