Python-以纯文本形式从邮件中提取正文

Python-以纯文本形式从邮件中提取正文,python,gmail,gmail-api,Python,Gmail,Gmail Api,我只想提取消息的正文并返回它。 我可以过滤字段并显示代码段,但不能显示正文 def GetMimeMessage(service, user_id, msg_id): try: message = service.users().messages().get(userId=user_id, id=msg_id, format='raw').execute() print 'Message snippet: %s' % message['sn

我只想提取消息的正文并返回它。 我可以过滤字段并显示代码段,但不能显示正文

def GetMimeMessage(service, user_id, msg_id):
    try:
            message = service.users().messages().get(userId=user_id, id=msg_id, format='raw').execute()
            print 'Message snippet: %s' % message['snippet']
            msg_str = base64.urlsafe_b64decode(message['raw'].encode('ASCII'))
            mime_msg = email.message_from_string(msg_str)
            return mime_msg
    except errors.HttpError, error:
            print 'An error occurred: %s' % error
试试这个:

mail
param是您的
mime\u msg
变量

def get_mpart(mail):
    maintype = mail.get_content_maintype()
    if maintype == 'multipart':
        for part in mail.get_payload():
            # This includes mail body AND text file attachments.
            if part.get_content_maintype() == 'text':
                return part.get_payload()
        # No text at all. This is also happens
        return ""
    elif maintype == 'text':
        return mail.get_payload()


def get_mail_body(mail):
    """
    There is no 'body' tag in mail, so separate function.
    :param mail: Message object
    :return: Body content
    """
    body = ""
    if mail.is_multipart():
        # This does not work.
        # for part in mail.get_payload():
        #    body += part.get_payload()
        body = get_mpart(mail)
    else:
        body = mail.get_payload()
    return body

base64url编码字符串在传递到解码函数之前需要进行一些更改,如下所示:

msg_str = base64.urlsafe_b64decode(message['raw'].replace('-_', '+/').encode('ASCII'))

看看这是否有帮助,谢谢。因此,经过一些修改后,解决方案如下:

def GetMessageBody(service, user_id, msg_id):
    try:
            message = service.users().messages().get(userId=user_id, id=msg_id, format='raw').execute()
            msg_str = base64.urlsafe_b64decode(message['raw'].encode('ASCII'))
            mime_msg = email.message_from_string(msg_str)
            messageMainType = mime_msg.get_content_maintype()
            if messageMainType == 'multipart':
                    for part in mime_msg.get_payload():
                            if part.get_content_maintype() == 'text':
                                    return part.get_payload()
                    return ""
            elif messageMainType == 'text':
                    return mime_msg.get_payload()
    except errors.HttpError, error:
            print 'An error occurred: %s' % error

我今天刚收到一封电子邮件,其中一个
多部分
嵌套在另一个
多部分
中,但失败了。现在尝试其他方法,这似乎是一个很有希望的类型错误:初始值必须是str或None,而不是bytestry email。message_from_bytes()感谢message_from_bytes()option@FabianBosler我收到了相同的错误。