Yahoo Just Released a Huge Machine Learning Dataset

Yahoo just released a 1.5 TB dataset of “anonymized user interactions on the news feeds”. If you have been looking for a new dataset to analyze, this just might be it. It contains approximately 110 billion rows of data regarding user-news interactions. Happy data exploring!






5 responses to “Yahoo Just Released a Huge Machine Learning Dataset”

  1. S Kirpalani Avatar

    why do you need a university email for doing any research?

  2. Society of Data Scientists Avatar

    Sometimes the *.EDU address can pose a problem.

    If you are interested in datasets without the need for an academic email address you may be interested in trying the following:

    1. Ryan Swanstrom Avatar

      Thanks for sharing,

Leave a Reply