This is a collection of data culled from the Metafilter database, for explorin' and crunchin' and statistifyin'.
If things are significantly out of date, poke cortex and he'll refresh it. Something specific you'd like to see here? Again, let cortex know.
For more information about the format and content of these files, as well as history of the project and links to analyses performed using the data, please see the Metafilter Wiki page about the Infodump.
New as of 12/13/09:
- Comment length files, showing length in characters of every comment.
- Metatalk thread closure information, in the deleted column of postdata_meta
- Approximate contact creation date info as a new column in the contact stats file
- userid munging, replacing userids (from folks requesting such) with unique, non-plausible ids
Please see the wiki for details.
These files list vital stats (postid, userid, datestamp, category, comment count, favorites count, deletion status & reason) for posts to each of the major subsites. Category codes vary for askme, meta and music, and are dummy 0 values for mefi. For more detail, see the wiki.
These files list postid and title text for posts.
These files list vital stats (commentid, postid, userid, datestamp, favorite count, best answer) for comments on each of the major subsites. Note that the "best answer" values are dummy 0 values for all but the askme file.
These files list commentid and comment length in characters.
These files contain vital stats (tagid, linkid, date, tagname) for tags attached to posts on mefi, askme, and music. Note that tag creation date is approximate in some cases; for more information, see the wiki.
This file contains vital stats (faveid, faver, favee, favetype, target, parent, datestamp) for every favorite on record. For an explanation of the favetype codes and the target/parent values, see the wiki.
This file contains a list of contacter, contactee, and (in some cases approximate) date of creation of mefi contact relationships.
This file contains a list of user id, username entries for all active users of the site.
This file includes all of the infodump files above in a single zip archive.