You are given a task of:
• Analyzing a big dataset 2 TBs of contractual JSON docs organized by doc _id
• Tagging entities (companies) in the documents using entity id from a collection of
company profiles
• Building a search system that in response to the query
• Retrieves related documents ranked by their relevance
• Retrieves related companies and their products
Question: What kinds of databases you would use for this
Provide a detailed summary (> 250 words) answering the question (The summary addressed all aspects of the design to accommodate the data management task.) :
What kinds of databases you would use for this task and why? (choosing between SQL, NoSQL, MongoDB,neo4j.)