GhostOfRobertMichels | 44 points | Nov 22 2016 09:03:05

Podesta Email Metadata Analysis - CURATED DUMP OF ALL NAMES AND EMAIL ADDRESSES

Hello r/pizzagate. My curiosity gotten the best of me, so I decided to download the raw Podesta emails and use data processing magic to see what might come of the effort.

In short, I wrote a small functional program that parses the header of every email, with the intent of extracting the To, From, and CC fields. IP and other fields are being considered for future release. Then, using the accumulated data, I filtere it, grouping the emails by contact name, sorted them, and merged any duplicate names and emails (at least as best as I could, given limited time and the horrifically dirty data).

A brief example showing John Podesta and some surrounding entries are below.

John Ost, Political
  jost@aft.org

John P
  john.podesta@gmail.com

John Patzakis
  jpatzakis@gmail.com

John Podesta
  cbelisle@americanprogressaction.org
  donate@americanprogress.org
  eberman@americanprogressaction.org
  eryn.sepp@gmail.com
  john.podesta@gmail.com
  John.Podesta@ptt.gov
  John_D_Podesta@who.eop.gov
  johnpodesta@gmail.com
  johnpodesta@hillaryclinton.com
  johnpodestatemp@outlook.com
  jp66@hillaryclinton.com
  jpodesta@americanprogress.org
  jpodesta@amprog.org
  jpodesta@centerforamericanprogress.net
  jpodesta@equitablegrowth.org
  jpodesta@hillaryclinton.com
  jpodesta@who.eop.gov
  podesta.mary@gmail.org
  podesta@americanprogress.org
  podesta@georgetown.edu
  podesta@law.georgetown.edu
  podestafam@aol.com

John Podesta -
  john.podesta@gmail.com

John Podesta - CAP (john.podesta@gmail.com)
  john.podesta@gmail.com

As you can see, the results aren't perfect. As I mentioned, the data is quite dirty. In some address books, John Podesta was entered as "John P", "John Podesta - CAP (john.podesta@gmail.com)", etc.

Also of note are strange associations e.g. "eryn.sepp@gmail.com" listed as one of Podesta's emails. In this case, Sepp's email was listed because she sometimes received emails as "John Podesta" e.g. when ordering tickets for him, using her personal email with this name. These links can actually be useful HUMINT, as they reveal relationships and interactions between targets.

Also note that I left accounts with names that failed to successfully decode, and many are at the top of the list. I did this for the sake of completeness--maybe someone will notice of these addresses despite having an associated name, and it could turn into a legitimate lead.

But man, (possible) false positives aside, that's still a lot of active email addresses for one guy. I wonder what he's hiding? If his widespread email use any indication, I wouldn't be surprised to find the man is living multiple lives.

Anyway, this could be cleaned further, but my time is limited, and I believe this will be of use in its current state. So, without further ado, here it is:

Standard Pastebin (slow)

Pasgebin raw file access (much faster)

Finally, time permitting, I may have some other things to share in the future.

Edit 1:

When searching, remember that this is going by contact name, and sometimes people opt for last then first name e.g.

Podesta John
  eryn.sepp@gmail.com
  john.podesta@gmail.com
  John_D_Podesta@who.eop.gov
  jpodesta@americanprogress.org
  podesta@law.georgetown.edu

Podesta John (john.podesta@gmail.com)
  john.podesta@gmail.com

Podesta John D.
  john.podesta@gmail.com
  jpodesta@americanprogress.org

Podesta, John
  john.podesta@gmail.com
  John_D_Podesta@who.eop.gov
  podesta@law.georgetown.edu

Podesta, John D.
  podesta@law.georgetown.edu

Podesta, John D.  MIL WHMO/WHCA (NO PSD)
  John_D_Podesta@who.eop.gov

Podesta, John D. WHMO/WHCA
  John_D_Podesta@who.eop.gov

Maybe I'll improve the name sanitization logic at some point, but for now, bear that in mind as you dig.

Edit 3:

To clear up any confusion that may arise from the seemingly mismatched names/emails, here's a moderately technical explanation. The software extracts display names from the email header metadata (e.g. the "friendly" name entered when naming a contact using your mail client), as well as the respective email. Once it has scraped all pertinent metadata, it projects it into an easily digestible list. Unless there are undiscovered implementation bugs, if you see an email listed, the connection is legitimate. However, if you come across any bugs, please PM me, and I will get them sorted as soon as I can.

What This Means And Why It Could Be Beneficial

If you see a strange association, such as a politician's email listed under another known politician's name, this is expected in some cases. Take the seemingly erroneous Podesta/eryn.sepp@gmail.com association I mentioned earlier. The cause of this is actually quite simple.

In this case, it looked like Nina Hachigian CC'ed Sepp using an account with a display name of "John Podesta". Why somebody might do that, I am not sure. In some cases, I observed innocuous activity, but I still think this worth looking at. These people shift between addresses at a rapid rate and are known to use aliases (recall DWS's shifty activities), and they clearly do so with the intention of hiding. Who knows? Maybe we'll uncover an alias that will lead to something that might bring it all down. A stretch, but food for though. And, as I mentioned, this artifacts are indicative of close relationships between trusted individuals (think Ping Pong empowerment trust). That in itself may be of value. Remain vigilant, and know that we will win this. History is on our side in ways they cannot begin to understand.

Edit 4: Listening Music

As previously mentioned, I nominate the phenomenal Johnny Cash song God's Gonna Cut You Down as background music for the hunt. I'm open to suggestions, so if you'd like to contribute any further additions, don't hesitate.

Edit 5: Unknown Emails

Emails with no associated display name as located at the bottom list under the [Unknown] header. These may be worth looking at to determine if the are alts for other well-known people. There are many emails that are currently many emails with unknown users.

permalink

Jnbntthrwy | 2 points | Nov 22 2016 15:27:17

My suggestion... Cross reference with: - Epstein's little black book - Clinton Foundation donors - Relationships to superpacs managed by David Brock

permalink

GhostOfRobertMichels | 1 points | Nov 22 2016 22:13:57

This is an excellent idea. Any other sources I can pull from?

permalink

Jnbntthrwy | 1 points | Nov 22 2016 22:20:16

Clinton Foundation Donors (official): https://www.clintonfoundation.org/contributors

permalink

Jnbntthrwy | 1 points | Nov 22 2016 22:21:15

I haven't found a plain text version of Epstein's contacts... here's a scan: https://www.documentcloud.org/documents/1508273-jeffrey-epsteins-little-black-book-redacted.html

permalink

Jnbntthrwy | 1 points | Nov 22 2016 22:23:03

Here's where it gets dicey... Brock manages around a dozen superPACs. Each one needs to be individually looked at to establish relationships. https://en.m.wikipedia.org/wiki/David_Brock

permalink

[deleted] | 1 points | Nov 22 2016 09:23:33

[removed]

permalink

GhostOfRobertMichels | 6 points | Nov 22 2016 09:26:13

I know, I mentioned this in the OP, and while it is noise in a sense, it is still of use.

Also of note are strange associations e.g. "eryn.sepp@gmail.com" listed as one of Podesta's emails. In this case, Sepp's email was listed because she sometimes received emails as "John Podesta" e.g. when ordering tickets for him, using her personal email. These links can actually be useful intel, as they reveal relationships and interactions between targets.

permalink

GhostOfRobertMichels | 1 points | Nov 22 2016 22:05:12

Further addressed in Edit 3, discussing why this happens. Eryn Sepp seems to send a lot of emails with "John Podesta" as the contact.

permalink