You can now download a dataset of 1.65 billion Reddit comments: Beware the Redditor AI

Screenshot 2015-06-26 16.39.01
Once our species’ greatest trove of knowledge was the Library of Alexandria. Now we have Reddit, a roiling mass of human ingenuity/douchebaggery that has recently focused on tearing itself apart like Tommy Wiseau in legendarily awful flick ‘The Room.’ But unlike the ancient library, the fruits of Reddit’s labors, good and ill, will not be destroyed in fire. In fact, thanks to Jason Baumgartner of PushShift.io (aided by The Internet Archive), a dataset of 1.65 billion comments, stretching from October 2007 to May 2015, is now available to download. The data – pulled using Reddit’s API – is made up of JSON objects, including the…

This story continues at The Next Web

from The Next Web http://ift.tt/1fuQn94
via IFTTT

0 Kommentare:

Kommentar posten