Skip to main content
Meta Stack Exchange

Questions tagged [data-dump]

For questions about the quarterly Creative Commons data dumps of all public data in the Stack Exchange network Q&A sites.

Filter by
Sorted by
Tagged with
26 votes
1 answer
441 views

Over one year ago, Stack Exchange Inc. drastically changed their data dump process and, aside from making it a major pain to download the entire dump of the Stack Exchange network, decided to add the ...
81 votes
0 answers
2k views
+50

More than a year ago - during the initial discussions about the new, per site data dump system, I'd asked if I could get a full copy of the data dumps. I followed up during the public release and I ...
75 votes
6 answers
3k views

TL;DR; as I wrote this post parallel to troubleshooting, I went from thinking there was an odd bug to thinking that the company is doing something intentional (or nefarious?) with the data dump again. ...
user avatar
15 votes
1 answer
302 views

$ sha256sum webapps.stackexchange.com.7z 7af2cfa857eed56f9396261b2985b387122b28f4a7fc43efc45629b20bf488c3 webapps.stackexchange.com.7z $ 7z e -so webapps.stackexchange.com.7z Posts.xml <?xml ...
23 votes
1 answer
356 views

Attempting to access the data dump page (this is a "current" user link) throws a server error. This happens across the network, e.g. on Stack Overflow. This has been happening for 2 days now,...
-42 votes
2 answers
503 views

Feature request: remove that clause when accessing an SE data dump: You can access this site's data for personal use. For inquiries about using the data for large language model (LLM) training, ...
15 votes
0 answers
189 views

Best demonstrated by query which returns no results when run on e.g. AskUbuntu: select * from posts where posttypeid = 1 and tags like '%<.%' This should return, for example, the 1,080 questions ...
15 votes
1 answer
385 views

Following the AI-generated Answers experiment on Stack Exchange sites that volunteered to participate announcements I wanted to ask if the company has already planned a way to filter out the (few?) AI-...
28 votes
3 answers
1k views

I wanted to query for answers where the user account has been deleted, but the answer is still up. According to this link (https://stackoverflow.com/help/deleting-account), deleting your account will ...
5 votes
0 answers
168 views

In recent changes more or less linked to Shifting the data dump schedule: A proposal, I notice that we do not have anymore the Sites.xml file in Stack Exchange data dumps. The file was still present ...
Benoit74B's user avatar
  • 151
10 votes
0 answers
226 views

As seen in New Vote Types in latest data dump?, some new vote types appeared (likely) exclusively in the Stack Overflow data dump. A couple users helped find out what each one meant. However, what ...
12 votes
1 answer
221 views

Following on from my previous request for current checksums I'm looking at 2 use cases for a data dump: verifying if a current download for a data dump is correct verifying if a specific historical ...
15 votes
2 answers
3k views

Thanks to everyone who posted bug reports and feature requests related to the updated data dumps process. Below, we’ve detailed some work on those reports and requests. Issues reported on this post: ...
Berthold's user avatar
  • 3,497
9 votes
1 answer
309 views

I ran across some JSON (magnet link) containing SE data that was created after the introduction of the new data dump process. Am I allowed to publicly reshare it (e.g., on https://archive.org), or ...
21 votes
3 answers
765 views

As I have been looking through the latest StackExchange data dump, it seems like a non-compliant XML serializer was used. There are numerous escape sequences that are simply invalid XML such as &#...

15 30 50 per page
1
2 3 4 5
...
35

AltStyle によって変換されたページ (->オリジナル) /