Mock Delta table standalone for testing purpose using local file system, is it possible ?

1,018 views
Skip to first unread message

Rajesh Datla

unread,
Oct 6, 2022, 10:15:46 PM10/6/22
to Delta Lake Users and Developers
Hi All,

How to Mock Delta table standalone for testing purpose using local file system, is it possible ? without using S3 or blob storage ?

I checked below link but unable to find right solution to Mock Delta tables


Thank You
Raj




Matthew Powers

unread,
Oct 7, 2022, 9:00:17 AM10/7/22
to Rajesh Datla, Delta Lake Users and Developers
We're working on a Delta Acceptance Testing project (DAT) that contains Delta Lake reference tables.  You can clone the repo and use these tables for testing.


I'd be cool to open source some code snippets that show how to read the DAT reference tables with Delta Standalone.  A blog post on this would also be useful.  Ping me anytime if you'd ever like to collab on these ;)

--
You received this message because you are subscribed to the Google Groups "Delta Lake Users and Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to delta-users...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/delta-users/e5033144-30c7-45bc-9b25-909aeef9c1den%40googlegroups.com.

Tathagata Das

unread,
Oct 7, 2022, 10:34:03 AM10/7/22
to Matthew Powers, Rajesh Datla, Delta Lake Users and Developers
Hey Rajesh, 

What kind of testing do you want to do? Delta Standalone currently does not natively support writing the parquet files in a Delta table, it can only read already written Delta tables. So the current best approach is to create Delta tables with Delta-on-spark or delta-rust, and then read them with Delta Standalone. 

What Matthew mentioned, the DAT project, is a community attempt to bridge the gap across all the individual delta projets and make sure they are all cross-compatible in what tables they can read and writes (that is, they all follow the delta protocol correctly). Its very new, so you can play with it, but if you want to really created tables, then best to use what i said.

Hope this helps.


Aurangzeb Khan

unread,
Oct 8, 2022, 2:50:56 AM10/8/22
to Delta Lake Users and Developers
I think, it's just a matter of installing delta locally.
pip install delta 
use PySpark to create / read a table using a local path ./folder/

Reply all
Reply to author
Forward
0 new messages