- What does a computer have to do in order to understand a natural language sentence?
- What is ambiguity?
- Why is natural language processing (NLP) difficult for computers?
- What is bag-of-words representation? Why do modern search engines use this simple representation of text?
- What are the two modes of text information access? Which mode does a web search engine such as Google support?
- When is browsing more useful than querying to help a user find relevant information?
- Why is a text retrieval task defined as a ranking task?
- What is a retrieval model?
- What are the two assumptions made by the Probability Ranking Principle?
- What is the Vector Space Retrieval Model? How does it work?
- How do we define the dimensions of the Vector Space Model? What does “bag of words” representation mean?
- What does the retrieval function intuitively capture when we instantiate a vector space model with bag of words representation and bit representation for documents and queries?