NLP Assignment-9 Solution
NLP Assignment-9 Solution
Assignment- 9
TYPE OF QUESTION: MCQ
Number of questions: 8
[Question 7 and 8 carries 2 marks each] Total mark: 6 X 1 + 2 X 2 = 10
1. Topic modelling discovers the hidden themes that pervade the collection
2. Topic modelling is a generative model
3. Dirichlet hyperparameter Beta used to represent document-topic Density?
4. None of the above
Answer: 1,2
1. The Dirichlet is an exponential family distribution on the simplex positive and negative
vectors sum to one
2. Correlated Topic Model (CTM) predicts better via correlated topics
3. LDA provides better fit than CTM
4. CTM draws topic distributions from a logistic normal
Answer: 2, 4
Question 3: You have a topic model with the parameters α = 0.89 and β = 0.04. Now, if you
want to have sparser distribution over words and denser distribution over topics, what
should be the values for α and β?
1. Both α and β values should be decreased
2. Both α and β values should be increased
3. α should be decreased, but β should be increased
4. α should be increased, but β should be decreased
Answer: 4
Solution:
α : topic distribution
β : word distribution
Answer: 1,4
Question 5: Which of the following is/are True about Relational Topic Model (RTM) ?
1. RTM formulation ensures that the same latent topic assignments used to generate the
content of the documents
2. In RTM, link function models each per-pair binary variable as linear regression
3. In RTM, covariates are constructed by the Hadamard product
4. Link probability function is dependant on the topic assignments that generated their
words
Answer: 1,3,4
Question 6:
Classically, topic models are introduced in the text analysis community for________________
topic discovery in a corpus of documents.
1. Unsupervised.
2. Supervised.
3. Semi-automated.
4. None of the above.
Answer - 1. Unsupervised
Answer: 3,4
Solution: Refer Gibbs Sampling slide
(number) –number inside the brackets denote the topic no. 1 and 2 denote whether the word is
currently assigned to topics t1 and t2 respectively. η = 0.3 and α = 0.3
For question 8 calculate the value upto 4 decimal points and choose your answer
Question 8 : Using the above structure the estimated value of β(2)nature at this point is
1. 0.0240
2. 0.02459
3. 0.0260
4. 0.0234
Answer: 1
Solution:
t1 t2
machine 0 4
nature 5 0
language 5 4
vision 3 0
learning 0 3