The third round of the Yelp Dataset Challenge is now open, giving students access to our massive Phoenix Academic Dataset, with reviews and businesses from the greater Phoenix metro area. This round runs through July 31, 2014 (23:59:59 UTC). Use our data and submit your project for a chance to win a $5,000 award! Just show us how you use Yelp data in insightful, unique, and compelling ways. A team of our data mining engineers and data scientists will read each paper and select the winner(s) (we are also hiring for both roles!).
We are also excited to announce that we have updated and extended the original Phoenix Academic Dataset! The original dataset, released in March 2013, has been well-received by the academic community, and has already been cited in papers and included in presentations around the world. For more information on past winners and their papers and to download the dataset, please check out the Yelp Dataset Challenge site.
The new dataset builds upon this foundation by not only refreshing it with new content created over the past year, but also including new data like business attributes, the social graph and tips.
The new Phoenix Academic Dataset (download) incorporates the following updates and new data types:
Businesses - 15,585 (+4,048 new businesses!)
Business Attributes - 111,561 (new!)
Check-in Sets - 11,434 (+3,152 new check-in sets!)
Tips - 113,993 (new!)
Users - 70,817 (+26,944 new users!)
User Connections - 151,516 (new!)
Reviews - 335,022 (+105,115 new reviews!)
This new data is available for immediate download at www.yelp.com/dataset_challenge and replaces the previous Phoenix Academic Dataset. We are eagerly anticipating seeing the projects and research that will be built using this data. We are especially excited to see the research related to the new content: from micropost analysis on tips to inferring business attributes from reviews to mining the rich social graph for insights. We look forward to what you come up with!
For winning research, there are additional awards available for publishing your work and presenting at industry or academic conferences.
For full details and to access the data, go to our website: www.yelp.com/dataset_challenge