I have a directory full of text files that is in a email format. Each file is of the following format.
----Email01.txt BEGIN ------
Message-ID: 84950814.1039855309681.JavaMail.ivans@hymn
Date: Sun, 30 Dec 2018 23:26:42 -0800 (PST)
From: [email protected]
To: [email protected]
Subject: ANCHORDESK: Hope ahead: What I learned
Hello…
Regards
----Email01.txt END ------
I’m trying to write scala code to create a DataFrame with these columns
Message-ID, Date, From, To, Subject
84950814.1039855309681.JavaMail.ivans@hymn, Sun, 30 Dec 2018 23:26:42 -0800 (PST), [email protected], [email protected], “ANCHORDESK: Hope ahead: What I learned”
Thanks.