Releases · RumbleDB/python-jsoniq

27 Aug 14:02

ghislainfourny

2.0.0a1

d94b4b7

jsoniq 2.0.0 alpha 1 Pre-release

Pre-release

When returning a single-column DataFrame with atomic values, the name is now __value and not value to avoid collisions with user-defined columns.
Improved schema inferrence: DataFrames can be returned in a wider range of cases.
Improved error display in notebooks when errors happen upon collecting the results and not already upon calling jsoniq().

Assets 2

21 Aug 12:18

ghislainfourny

0.2.0a9

955145c

jsoniq 0.2.0 alpha 9 Pre-release

Pre-release

Stability improvements.

Assets 2

21 Aug 12:04

ghislainfourny

0.2.0a8

c61f5dc

jsoniq 0.2.0 alpha 8 Pre-release

Pre-release

Variables can now be bound to JSON values, pandas DataFrames or pyspark DataFrames with extra parameters to the rumble.jsoniq() call. It is no longer necessary to explicitly call bind(). This is similar to how DataFrames can be attached to views with extra parameters to spark.sql().
rumble.lastResult is now correctly assigned also when partial data is returned (only with the partial data).
Fixed issue with empty array constructors.

Assets 2

20 Aug 12:37

ghislainfourny

0.2.0a7

05d0ee6

jsoniq 0.2.0 alpha 7 Pre-release

Pre-release

rumble.lastResult now returns a pyspark/pandas DataFrame or rdd or tuple and no longer the sequence object.
Enhance schema detection. When the detected static type of the overall query is DataFrame-compatible, it is now automatically possible to obtain the output as a DataFrame without explicitly giving a schema.
It is now possible to access a table previously registered as a view via a table() function call. This is an alternative to binding variables.
Enhancements in the JSONiq Update Facility support to update delta files and Hive metastore tables.

Assets 2

13 Aug 14:04

ghislainfourny

0.2.0a6

8cfc3a7

jsoniq 0.2.0 alpha 6 Pre-release

Pre-release

Fix a bug with the config() call of the builder.
add withDelta() to configure Delta Lake tables and files, for use with the JSONiq Update Facility.

Assets 2

08 Aug 08:42

ghislainfourny

0.2.0a5

0b137cf

jsoniq 0.2.0 alpha 5 Pre-release

Pre-release

If the initialization of the Spark session fails, we now check if SPARK_HOME is set and if it may be invalid or pointing to a different Spark version than 4.0, and output a more informative error message.

Assets 2

06 Aug 14:47

ghislainfourny

0.2.0a4

2fea77c

jsoniq 0.2.0 alpha 4 Pre-release

Pre-release

Added parameters to the jsoniq magic to select the desired output to print: -j, -df, -pdf
Added informative error message with a hint on how to fix when trying to get a DataFrame and there is no schema.
Added parameter -t to the jsoniq magic to measure the response time
The RumbleSession object now saves the latest result (sequence of items) in a field called lastResult. This is particularly useful in notebooks for post-processing a result in Python after obtained it through the jsoniq magic.
Improved static type detection upon binding a pandas or pyspark DataFrame as an input variable to a JSONiq queries.
Now also accepts pandas version 2.2.

Assets 2

04 Aug 12:35

ghislainfourny

0.2.0a2

0225d19

0.2.0a2: Merge pull request #9 from RumbleDB/JSONiqMagic Pre-release

Pre-release

You can change the result size cap through to the now accessible Rumble configuration (for example rumble .getRumbleConf().setResultSizeCap(10)). This controls how many items can be retrieved at most with a json() call. You can increase it to whichever number you would like if you reach the cap.
Add the JSONiq magic to execute JSONiq queries directly in a notebook cell, using the RumbleDB instance shipped with the library.
RumbleSession.builder.getOrCreate() now correctly reuses an existing session instead of creating a new object. It preserves the configuration.

Assets 2

22 Jul 08:55

ghislainfourny

0.2.0a1

5ddcc1b

jsoniq 0.2.0 alpha 1 Pre-release

Pre-release

Allow to bind JSONiq variables to pandas dataframes
Allow to retrieve the output of a JSONiq query as a pandas dataframe (if the output is available as a dataframe, i.e., availableOutputs() returns a list that contains "DataFrame")
Clean up the mapping to strictly map tuples to sequence of items, and lists ot array items. This will avoid confusion between arrays and sequences.
As a consequence, json() now returns a tuple, not a list.
Calling bind() with a single list will return an informative error. Use bind() with a tuple instead, or call bindOne() to interpret the list as a sequence of one array item.

Assets 2

21 Jul 11:25

ghislainfourny

0.1.0a12

a4bb9df

jsoniq 0.1.0 alpha 12 Pre-release

Pre-release

Allow to bind JSONiq variables to Python values (mapping Python lists to sequences of items). This makes it possible to manipulate - Python values directly with JSONiq and even without any knowledge of Spark at all.
renamed bindDataFrameAsVariable() to bind(), which can be used both with DataFrames and Python lists.
add bindOne() for binding a single value to a JSONiq variable.
wrapping df() in a Pyspark DataFrame to make sure it can be used with pyspark DataFrame transformations.

Assets 2

Releases: RumbleDB/python-jsoniq

jsoniq 2.0.0 alpha 1

Uh oh!

jsoniq 0.2.0 alpha 9

Uh oh!

jsoniq 0.2.0 alpha 8

Uh oh!

jsoniq 0.2.0 alpha 7

Uh oh!

jsoniq 0.2.0 alpha 6

Uh oh!

jsoniq 0.2.0 alpha 5

Uh oh!

jsoniq 0.2.0 alpha 4

Uh oh!

0.2.0a2: Merge pull request #9 from RumbleDB/JSONiqMagic

Uh oh!

jsoniq 0.2.0 alpha 1

Uh oh!

jsoniq 0.1.0 alpha 12

Uh oh!