PXF Hive/HDFS parquet - unwanted timestamp conversion

217 views
Skip to first unread message

ales....@gmail.com

unread,
May 3, 2022, 4:59:56 AM5/3/22
to Greenplum Users
Hello,
I have a parquet file on HDFS with timestamp column and Hive table.
If I query data via PXF Hive or PXF HDFS I get a different value for timestamp columns comparing to the direct hive query (some timezone shift).

PXF JVM is set to operate in UTC zone via -Duser.timezone parameter in pxf-env.sh. I can't change this setting to make sure other JDBC connections works correctly.

Question is why this unwanted conversion for PXF Hive/HDFS happens and if there is any way how to avoid it?

Thank you

ales....@gmail.com

unread,
May 3, 2022, 5:43:19 AM5/3/22
to Greenplum Users, ales....@gmail.com
Just a note: also tried hive.parquet.timestamp.skip.conversation  property  w/o any change in behavior.
Message has been deleted
Message has been deleted
Message has been deleted
Message has been deleted

Ashuka Xue

unread,
May 3, 2022, 2:59:32 PM5/3/22
to Alexander Denissov, Bradford Boyle, Venkatesh Raghavan, Himanshu Pandey, Bhuvnesh Chaudhary, Greenplum Users
Hi,

Could you raise an issue on Github (https://github.com/greenplum-db/pxf) with the following information:

- Greenplum Version
- PXF version
- Hadoop and Hive versions
- Greenplum table DDL
- Parquet schema and sample parquet data if possible
- Configuration files such as pxf-env.sh, hive-site.xml, hdfs-site.xml, etc.

Thanks,
Ashuka

From: Alexander Denissov <aden...@vmware.com>
Sent: Tuesday, May 3, 2022 10:15 AM
To: Bradford Boyle <brad...@vmware.com>; Ashuka Xue <ax...@vmware.com>; Venkatesh Raghavan <ragha...@vmware.com>; Himanshu Pandey <pand...@vmware.com>; Bhuvnesh Chaudhary <bchau...@vmware.com>
Subject: FW: [gpdb-users] Re: PXF Hive/HDFS parquet - unwanted timestamp conversion
 

 

 

From: ales....@gmail.com <ales....@gmail.com>
Date: Tuesday, May 3, 2022 at 2:43 AM
To: Greenplum Users <gpdb-...@greenplum.org>
Cc: pvtl-cont-ales.kotuc <ales....@gmail.com>
Subject: [gpdb-users] Re: PXF Hive/HDFS parquet - unwanted timestamp conversion

External Email

--
You received this message because you are subscribed to the Google Groups "Greenplum Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gpdb-users+...@greenplum.org.
To view this discussion on the web visit https://groups.google.com/a/greenplum.org/d/msgid/gpdb-users/b9c24911-8357-47cf-9aa6-7b9757be2fa2n%40greenplum.org.

 


External Email: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender.

ales....@gmail.com

unread,
May 5, 2022, 9:44:26 AM5/5/22
to Greenplum Users, ax...@vmware.com, Alexander Denissov, Bradford Boyle, Venkatesh Raghavan, Himanshu Pandey, Bhuvnesh Chaudhary
Reply all
Reply to author
Forward
0 new messages