Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/306.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 整理一系列的尝试。。。除了声明?_Python_Json_Error Handling - Fatal编程技术网

Python 整理一系列的尝试。。。除了声明?

Python 整理一系列的尝试。。。除了声明?,python,json,error-handling,Python,Json,Error Handling,我已经编写了一个脚本,通过管道将数据从Kustomer API传输到我们的数据库,虽然它工作得很好,但有点混乱,我想知道是否有更优雅的解决方案。我将推送的结果行定义为一个字典,然后推送到MySQL,但当JSON中的某些值不始终可用时,就会出现混乱的部分 这导致每个数据点都会出现try/except语句,这些语句可能会丢失,也可能不会丢失 有更好的方法吗?代码如下 try: record_data = { 'id': record['id'],

我已经编写了一个脚本,通过管道将数据从Kustomer API传输到我们的数据库,虽然它工作得很好,但有点混乱,我想知道是否有更优雅的解决方案。我将推送的结果行定义为一个字典,然后推送到MySQL,但当JSON中的某些值不始终可用时,就会出现混乱的部分

这导致每个数据点都会出现try/except语句,这些语句可能会丢失,也可能不会丢失

有更好的方法吗?代码如下

    try:
        record_data = {
            'id': record['id'],
            'created_at': str(datetime.strptime(record['attributes']['createdAt'], '%Y-%m-%dT%H:%M:%S.%fZ'))[:-7],
            'last_activity_at': str(datetime.strptime(record['attributes']['lastActivityAt'], '%Y-%m-%dT%H:%M:%S.%fZ'))[:-7],
            'first_marked_done': None,
            'last_marked_done': None,
            'assigned_team': record['attributes']['assignedTeams'][0] if record['attributes']['assignedTeams'] != [] else None,
            'conversation_type': None,
            'conversation_category': None,
            'conversation_subcategory': None,
            'message_count': record['attributes']['messageCount'],
            'note_count': record['attributes']['noteCount'],
            'satisfaction': record['attributes']['satisfaction'],
            'status': None,
            'email': 1 if len(list(filter(lambda x: x == 'email', record['attributes']['channels']))) > 0 else 0,
            'chat': 1 if len(list(filter(lambda x: x == 'chat', record['attributes']['channels']))) > 0 else 0,
            'priority': record['attributes']['priority'],
            'direction': 'outbound' if record['attributes']['direction'] == 'out' else 'in',
            'nlp_score': None,
            'nlp_sentiment': None,
            'waiting_for': None,
            'sla_breach': None,
            'sla_status': None,
            'breached_sla': None,
            'breached_at': None
        }
        try:
            record_data['status'] = record['attributes']['status']
        except KeyError:
            pass
        try:
            record_data['conversation_type'] = record['attributes']['custom']['typeStr']
            record_data['conversation_category'] = str(record['attributes']['custom']['categoryTree']).split('.')[0]
            record_data['conversation_subcategory'] = str(record['attributes']['custom']['categoryTree']).split('.')[1] if len(str(record['attributes']['custom']['categoryTree']).split('.')) > 1 else None
        except KeyError:
            pass
        try:
            record_data['waiting_for'] = record['attributes']['custom']['typeStr']
        except KeyError:
            pass
        try:
            record_data['first_marked_done'] = str(datetime.strptime(record['attributes']['firstDone']['createdAt'], '%Y-%m-%dT%H:%M:%S.%fZ'))[:-7]
            record_data['last_marked_done'] = str(datetime.strptime(record['attributes']['lastDone']['createdAt'], '%Y-%m-%dT%H:%M:%S.%fZ'))[:-7]

        except KeyError:
            pass
        try:
            record_data['sla_breach'] = 0 if record['attributes']['sla']['breached'] is False else 1
            record_data['sla_status'] = record['attributes']['sla']['status']
            if record_data['sla_breach'] == 1:
                try:
                    record_data['breached_sla'] = record['attributes']['sla']['breach']['metric']
                    record_data['breached_at'] = record['attributes']['sla']['breach']['at']
                except KeyError:
                    for m in record['attributes']['sla']['metrics']:
                        try:
                            if record['attributes']['sla']['metrics'][m]['breachAt'] == record['attributes']['sla']['summary']['firstBreachAt']:
                                record_data['breached_sla'] = m
                                record_data['breached_at'] = str(datetime.strptime(record['attributes']['sla']['summary']['firstBreachAt'], '%Y-%m-%dT%H:%M:%S.%fZ'))[:-7]
                        except KeyError:
                            pass
        except KeyError:
            record_data['sla_breach'] = 0
        print(record_data)
        self.db.insert_update(KustomerConversations(**record_data))

    except KeyError:
        pass

您可以使用函数,它从嵌套的dict中为您提供元素,并且如果它不存在,则不会引发异常

像这样的速写:

def get_nested_dict_value(src_dict, *nested_keys, **kwargs):
    """
    Get value of some nested dict by series of keys with default value.
    Example:
    instead of: 
        x = data['a']['b']['c']['d']
    use
        x = get_nested_dict_value(data, 'a', 'b', 'c', 'd')
    or, if you need some non-None default value, add default=xxx kwarg:
        x = get_nested_dict_value(data, 'a', 'b', 'c', 'd', default=0)
    """
    default = kwargs.get("default", None)
    pointer = src_dict
    i = 0
    for key in nested_keys:
        i += 1
        if key in pointer:
            pointer = pointer[key]
            if i == len(nested_keys):
                return pointer
        else:
            return default
因此,不是:

try:    
    record_data['conversation_type'] = record['attributes']['custom']['typeStr']
except Exception:
    pass
您只需键入:

record_data['conversation_type'] = get_nested_dict_value(record, 'attributes', 'custom', 'typeStr')

首先,在可能的情况下,您应该尝试使用指定了默认值的
dict.get
。接下来,您可以考虑使代码变得更干净。考虑这一点:

try:
    record_data['status'] = record['attributes']['status']
except KeyError:
    pass
try:
    record_data['conversation_type'] = record['attributes']['custom']['typeStr']
except KeyError:
    pass
try:
    record_data['waiting_for'] = record['attributes']['custom']['typeStr']
except KeyError:
    pass
try:
    record_data['first_marked_done'] = record['attributes']['firstDone']['createdAt']
except KeyError:
    pass
现在重写,您可以确保一致的错误处理,而无需重复逻辑:

from contextlib import contextmanager

@contextmanager
def error_handling():
    try:
        yield
    except KeyError:
        pass

with error_handling():
    record_data['status'] = record['attributes']['status']
with error_handling():
    record_data['conversation_type'] = record['attributes']['custom']['typeStr']
with error_handling():
    record_data['waiting_for'] = record['attributes']['custom']['typeStr']
with error_handling():
    record_data['first_marked_done'] = record['attributes']['firstDone']['createdAt']

您可以为希望应用的各种规则定义任意数量的函数,如
错误处理

输入端和输出端的不同命名约定使得显式赋值的清晰度难以匹敌。保留版本的精确语义(例如,在没有
typeStr
的情况下,即使
categoryTree
可用,它也不会分配
conversation\u category
),这排除了某些选择(例如在每次访问时使用
try
/
循环数据结构);如果对输入数据进行更多假设,您可能会做得更好

尽管如此,除了
dict.get
,您还可以使用内置(
any
、和
dict
)并引入辅助函数和一些临时变量,以使代码更具可读性:

# this gives one digit of the hour for me...?
def ptime(s): return str(datetime.strptime(s,'%Y-%m-%dT%H:%M:%S.%fZ'))[:-7]

try:
    attr=record['attributes']
    cust=attr.get('custom',{})  # defer KeyErrors into the below
    record_data = dict(
        id = record['id'],
        created_at = ptime(attr['createdAt']),
        last_activity_at = ptime(attr['lastActivityAt']),
        first_marked_done = None,
        last_marked_done = None,
        assigned_team = attr['assignedTeams'][0] or None,
        conversation_type = None,
        conversation_category = None,
        conversation_subcategory = None,
        message_count = attr['messageCount'],
        note_count = attr['noteCount'],
        satisfaction = attr['satisfaction'],
        status = attr.get('status'),
        email = int(any(x == 'email' for x in attr['channels'])),
        chat = int(any(x == 'chat' for x in attr['channels'])),
        priority = attr['priority'],
        direction = 'outbound' if attr['direction'] == 'out' else 'in',
        nlp_score = None,
        nlp_sentiment = None,
        waiting_for = cust.get('typeStr'),
        sla_breach = 0,
        sla_status = None,
        breached_sla = None,
        breached_at = None
    )
    try:
        record_data['conversation_type'] = cust['typeStr']
        cat=str(cust['categoryTree']).split('.')
        record_data['conversation_category'] = cat[0]
        record_data['conversation_subcategory'] = cat[1] if len(cat) > 1 else None
    except KeyError: pass
    try:
        record_data['first_marked_done'] = ptime(attr['firstDone']['createdAt'])
        record_data['last_marked_done'] = ptime(attr['lastDone']['createdAt'])
    except KeyError: pass
    try:
        sla=attr['sla']
        record_data['sla_breach'] = 0 if sla['breached'] is False else 1
        record_data['sla_status'] = sla['status']
        if record_data['sla_breach'] == 1:
            try:
                record_data['breached_sla'] = sla['breach']['metric']
                record_data['breached_at'] = sla['breach']['at']
            except KeyError:
                for m,v in sla['metrics'].items():
                    try:
                        v=v['breachAt']
                        if v == sla['summary']['firstBreachAt']:
                            record_data['breached_sla'] = m
                            record_data['breached_at'] = ptime(v)
                    except KeyError: pass
    except KeyError: pass
    print(record_data)
    self.db.insert_update(KustomerConversations(**record_data))

except KeyError: pass

虽然您可能有一个反对它的策略,但在这种情况下,我建议在每一行上编写剩余的
,除了KeyError:pass
子句:这有助于对暂定代码进行可视化括起来。

dict.get(key,default=None)哦,伙计,我无法理解所有这些的逻辑:)我认为@SebastianLoehner的建议可能是正确的:尝试使用
.get()
访问密钥,因为它不会抛出
KeyError
,而是简单地返回
None
。也就是说,嵌套键并不容易。响应结构真的如此不可预测吗?