Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 547 Bytes

File metadata and controls

9 lines (7 loc) · 547 Bytes

Data Engineering - Project 2

Assignment

  • Write a program that indexes .html files of scientific papers (such as those extracted from project 1). Consider at least two fields (and then create corresponding indexes) such as title, authors and abstract can be considered as fields. For each field use an appropriate analyzer.
  • Write a program that reads the query from console, queries the index, and prints the result or a small web application where queries can be written through a form.
  • Test the system with a dozen different queries