LINQ和从对象数组中删除重复项

LINQ和从对象数组中删除重复项,linq,distinct,Linq,Distinct,我正在尝试使用两列对对象数组进行重复数据消除,其中第二列是字典。描述这一点的最佳方法是显示一些代码: class MyClass { public int ID; public Dictionary<int, int> Dict = new Dictionary<int, int>(); } 请注意,其中一个对象已从原始列表中删除,因为它具有重复的ID和Dict字典值。我一直在使用以下替代版本: v

我正在尝试使用两列对对象数组进行重复数据消除,其中第二列是字典。描述这一点的最佳方法是显示一些代码:

    class MyClass
    {
        public int ID;
        public Dictionary<int, int> Dict = new Dictionary<int, int>();
    }
请注意,其中一个对象已从原始列表中删除,因为它具有重复的ID和Dict字典值。我一直在使用以下替代版本:

        var s = from p in list
                 group p by p.ID into group1
                 from group2 in
                     (from p in group1 group p by p.Dict)
                 group group2 by group1.Key;
但只是运气不好。感谢大家对解决这个问题的见解


PS-我不会改变规则,但我相信GROUPBY和SELECTFIRST会比DISTINCT更干净,它有额外的比较器代码。任何人都可以通过GROUP BY解决这个问题。

对于引用类型,您应该添加相等比较器,以便执行您想要的操作。添加以下类:

public class MyClassComparer : IEqualityComparer<MyClass>
    {
        public bool Equals(MyClass left, MyClass right)
        {
            if (left == null && right == null)
            {
                return true;
            }
            if (left == null || right == null)
            {
                return false;
            }

            if (left.ID == right.ID)
            {
                if (left.Dict == null && right.Dict == null)
                {
                    return true;
                }
                if (left.Dict == null || right.Dict == null)
                {
                    return false;
                }
                if (left.Dict.Count != right.Dict.Count)
                {
                    return false;
                }

                foreach(var key in left.Dict.Keys)
                {
                    if(!right.Dict.ContainsKey(key))
                        return false;

                    if (left.Dict[key] != right.Dict[key])
                     return false;
                }

                return true;
            }
            else return false;
        }

        public int GetHashCode(MyClass author)
        {
            return (author.ID).GetHashCode();
        }
    }

您应该改进GetHashCode方法。这将是你的家庭作业:

我能为以下几点拍拍吗

var filteredList = list.GroupBy(mc => mc.ID)
                       .SelectMany(gr => gr.Distinct(new MyClassComparer()))
                       .ToList();
比较器:

public class MyClassComparer : IEqualityComparer<MyClass>
{
    public bool Equals(MyClass a, MyClass b)
    {
        return a.Dict.Count == b.Dict.Count && !a.Dict.Except(b.Dict).Any();
    }


    public int GetHashCode(MyClass a)
    {
        return a.ID;
    }
}
List<MyClass> list = new List<MyClass>();
MyClass mc1 = new MyClass();
list.Add(mc1); mc1.ID = 1; mc1.Dict.Add(1, 1);

MyClass mc2 = new MyClass();
list.Add(mc2); mc2.ID = 1; mc2.Dict.Add(1, 1);

MyClass mc3 = new MyClass();
list.Add(mc3); mc3.ID = 1; mc3.Dict.Add(1, 2);

MyClass mc4 = new MyClass();
list.Add(mc4); mc4.ID = 2; mc4.Dict.Add(1, 1);


var result = list.Distinct(new MyClassComparer()).ToList();
var filteredList = list.GroupBy(mc => mc.ID)
                       .SelectMany(gr => gr.Distinct(new MyClassComparer()))
                       .ToList();
public class MyClassComparer : IEqualityComparer<MyClass>
{
    public bool Equals(MyClass a, MyClass b)
    {
        return a.Dict.Count == b.Dict.Count && !a.Dict.Except(b.Dict).Any();
    }


    public int GetHashCode(MyClass a)
    {
        return a.ID;
    }
}