GhostOfRobertMichels | 44 points
Podesta Email Metadata Analysis - CURATED DUMP OF ALL NAMES AND EMAIL ADDRESSES
Hello r/pizzagate. My curiosity gotten the best of me, so I decided to download the raw Podesta emails and use data processing magic to see what might come of the effort.
In short, I wrote a small functional program that parses the header of every email, with the intent of extracting the To, From, and CC fields. IP and other fields are being considered for future release. Then, using the accumulated data, I filtere it, grouping the emails by contact name, sorted them, and merged any duplicate names and emails (at least as best as I could, given limited time and the horrifically dirty data).
A brief example showing John Podesta and some surrounding entries are below.
John Ost, Political
jost@aft.org
John P
john.podesta@gmail.com
John Patzakis
jpatzakis@gmail.com
John Podesta
cbelisle@americanprogressaction.org
donate@americanprogress.org
eberman@americanprogressaction.org
eryn.sepp@gmail.com
john.podesta@gmail.com
John.Podesta@ptt.gov
John_D_Podesta@who.eop.gov
johnpodesta@gmail.com
johnpodesta@hillaryclinton.com
johnpodestatemp@outlook.com
jp66@hillaryclinton.com
jpodesta@americanprogress.org
jpodesta@amprog.org
jpodesta@centerforamericanprogress.net
jpodesta@equitablegrowth.org
jpodesta@hillaryclinton.com
jpodesta@who.eop.gov
podesta.mary@gmail.org
podesta@americanprogress.org
podesta@georgetown.edu
podesta@law.georgetown.edu
podestafam@aol.com
John Podesta -
john.podesta@gmail.com
John Podesta - CAP (john.podesta@gmail.com)
john.podesta@gmail.com
As you can see, the results aren't perfect. As I mentioned, the data is quite dirty. In some address books, John Podesta was entered as "John P", "John Podesta - CAP (john.podesta@gmail.com)", etc.
Also of note are strange associations e.g. "eryn.sepp@gmail.com" listed as one of Podesta's emails. In this case, Sepp's email was listed because she sometimes received emails as "John Podesta" e.g. when ordering tickets for him, using her personal email with this name. These links can actually be useful HUMINT, as they reveal relationships and interactions between targets.
Also note that I left accounts with names that failed to successfully decode, and many are at the top of the list. I did this for the sake of completeness--maybe someone will notice of these addresses despite having an associated name, and it could turn into a legitimate lead.
But man, (possible) false positives aside, that's still a lot of active email addresses for one guy. I wonder what he's hiding? If his widespread email use any indication, I wouldn't be surprised to find the man is living multiple lives.
Anyway, this could be cleaned further, but my time is limited, and I believe this will be of use in its current state. So, without further ado, here it is:
Pasgebin raw file access (much faster)
Finally, time permitting, I may have some other things to share in the future.
When searching, remember that this is going by contact name, and sometimes people opt for last then first name e.g.
Podesta John
eryn.sepp@gmail.com
john.podesta@gmail.com
John_D_Podesta@who.eop.gov
jpodesta@americanprogress.org
podesta@law.georgetown.edu
Podesta John (john.podesta@gmail.com)
john.podesta@gmail.com
Podesta John D.
john.podesta@gmail.com
jpodesta@americanprogress.org
Podesta, John
john.podesta@gmail.com
John_D_Podesta@who.eop.gov
podesta@law.georgetown.edu
Podesta, John D.
podesta@law.georgetown.edu
Podesta, John D. MIL WHMO/WHCA (NO PSD)
John_D_Podesta@who.eop.gov
Podesta, John D. WHMO/WHCA
John_D_Podesta@who.eop.gov
Maybe I'll improve the name sanitization logic at some point, but for now, bear that in mind as you dig.
To clear up any confusion that may arise from the seemingly mismatched names/emails, here's a moderately technical explanation. The software extracts display names from the email header metadata (e.g. the "friendly" name entered when naming a contact using your mail client), as well as the respective email. Once it has scraped all pertinent metadata, it projects it into an easily digestible list. Unless there are undiscovered implementation bugs, if you see an email listed, the connection is legitimate. However, if you come across any bugs, please PM me, and I will get them sorted as soon as I can.
If you see a strange association, such as a politician's email listed under another known politician's name, this is expected in some cases. Take the seemingly erroneous Podesta/eryn.sepp@gmail.com association I mentioned earlier. The cause of this is actually quite simple.
Navigate to emailid/25870 .
Select the WikiLeaks View source tab to the right of the View email tab, and then using your browser's page search capabilities, look for eryn.sepp@gmail.com . The matches should quickly demonstrate the cause of the observed behavior:
From: Nina Hachigian nhachigian@americanprogress.org To: John Podesta john.podesta@gmail.com CC: John Podesta eryn.sepp@gmail.com Subject: my news Thread-Topic: my news
In this case, it looked like Nina Hachigian CC'ed Sepp using an account with a display name of "John Podesta". Why somebody might do that, I am not sure. In some cases, I observed innocuous activity, but I still think this worth looking at. These people shift between addresses at a rapid rate and are known to use aliases (recall DWS's shifty activities), and they clearly do so with the intention of hiding. Who knows? Maybe we'll uncover an alias that will lead to something that might bring it all down. A stretch, but food for though. And, as I mentioned, this artifacts are indicative of close relationships between trusted individuals (think Ping Pong empowerment trust). That in itself may be of value. Remain vigilant, and know that we will win this. History is on our side in ways they cannot begin to understand.
As previously mentioned, I nominate the phenomenal Johnny Cash song God's Gonna Cut You Down as background music for the hunt. I'm open to suggestions, so if you'd like to contribute any further additions, don't hesitate.
Emails with no associated display name as located at the bottom list under the [Unknown] header. These may be worth looking at to determine if the are alts for other well-known people. There are many emails that are currently many emails with unknown users.
[deleted] | 1 points
[removed]
GhostOfRobertMichels | 6 points
I know, I mentioned this in the OP, and while it is noise in a sense, it is still of use.
Also of note are strange associations e.g. "eryn.sepp@gmail.com" listed as one of Podesta's emails. In this case, Sepp's email was listed because she sometimes received emails as "John Podesta" e.g. when ordering tickets for him, using her personal email. These links can actually be useful intel, as they reveal relationships and interactions between targets.
GhostOfRobertMichels | 1 points
Further addressed in Edit 3, discussing why this happens. Eryn Sepp seems to send a lot of emails with "John Podesta" as the contact.
Jnbntthrwy | 2 points | Nov 22 2016 15:27:17
My suggestion... Cross reference with: - Epstein's little black book - Clinton Foundation donors - Relationships to superpacs managed by David Brock
permalink
GhostOfRobertMichels | 1 points | Nov 22 2016 22:13:57
This is an excellent idea. Any other sources I can pull from?
permalink
Jnbntthrwy | 1 points | Nov 22 2016 22:20:16
Clinton Foundation Donors (official): https://www.clintonfoundation.org/contributors
permalink
Jnbntthrwy | 1 points | Nov 22 2016 22:21:15
I haven't found a plain text version of Epstein's contacts... here's a scan: https://www.documentcloud.org/documents/1508273-jeffrey-epsteins-little-black-book-redacted.html
permalink
Jnbntthrwy | 1 points | Nov 22 2016 22:23:03
Here's where it gets dicey... Brock manages around a dozen superPACs. Each one needs to be individually looked at to establish relationships. https://en.m.wikipedia.org/wiki/David_Brock
permalink