Hi,
In apache beam step I have a collection of KV<String, Iterable<KV<Long, GenericRecord>>>> .
I want to write all the records in the iterable to the same parquet file. My code snippet is given below
p.apply(ParDo.of(new MapWithAvroSchemaAndConvertToGenericRecord())) // PCollection<GenericRecord>
.apply(ParDo.of(new MapKafkaGenericRecordValue(formatter, options.getFileNameDelimiter()))) //PCollection<KV<String, KV<Long, GenericRecord>>>
.apply(GroupByKey.create()) //PCollection<KV<String, Iterable<KV<Long, GenericRecord>>>>>
now I want to write all the Records in the Iterable in the same parquet file(derive the file name by the key of KV).