Java Spark流式数据集未正确序列化数据
我在spark structured streaming中有以下代码Java Spark流式数据集未正确序列化数据,java,apache-spark,apache-spark-sql,spark-streaming,Java,Apache Spark,Apache Spark Sql,Spark Streaming,我在spark structured streaming中有以下代码 Dataset<Person> personDf = dataDf.map( (MapFunction<Data, Person>) data -> { Person person = new Person; person.setName(data.getName()); Details details = new Details
Dataset<Person> personDf = dataDf.map(
(MapFunction<Data, Person>) data -> {
Person person = new Person;
person.setName(data.getName());
Details details = new Details();
details.setAge(data.getAge());
details.setGender(data.getGender());
person.setDetails(details);
return person;
}, Encoders.bean(Person.class));
personDf
.writeStream()
.format("parquet")
.start("/home/hadoop/test/");
写
personDf
时上面代码的问题我只能看到一个名为name的列,拼花地板中缺少details列,我这里缺少什么,有人能帮我吗?你有什么解决办法吗?我也面临类似的问题。如果你能分享你如何解决这个问题的见解,那会很有帮助。你有什么解决办法吗?我也面临类似的问题。如果您能分享您如何解决此问题的见解,将非常有帮助
Person Object
public class Person implements serializable{
private String name;
private Details details;
//Getters & setters
//Hashcode and equals
}
Details Object
public class Details implements serializable{
private String age;
private String gender;
//Getters & setters
//Hashcode and equals
}