Mining Novel Data from Large Unlabeled Corpus - OpenAI Interview Question | DarkInterview