1 | initial version |
To extract a specific section of the email content, you can use Python's regular expression module - re
. You can search for a specific pattern within the email body and extract only that part.
Here's an example code that extracts the first paragraph of an email:
import win32com.client
import re
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.GetDefaultFolder(6)
messages = inbox.Items
for message in messages:
subject = message.Subject
body = message.Body
# Search for the first paragraph
pattern = r'^.*?(\n{2,})'
match = re.search(pattern, body, re.DOTALL)
if match:
first_para = match.group(0)
# Transfer the first paragraph to Excel
# ...
In this code, we first import the re
module and use it to search for the first paragraph of the email body. The regex pattern '^.*?(\n{2,})'
matches any characters (.*?
) from the start of the string (^
) until it finds two or more consecutive newlines (\n{2,}
). The re.DOTALL
flag is used to allow the .
character to match newlines.
If the pattern is found, we extract the matched text (match.group(0)
) and transfer it to Excel. Note that you'll need to modify the code to suit your specific use case, such as selecting a different regex pattern to match the section of the email content you need.