Skip to content

Check Remove duplicates functionality from SparkIcebergUtils #5

Description

@Neuw84

spark .sql(""" INSERT OVERWRITE employee SELECT employee_id, start_date, first(team),first(role),first(address),first(name) FROM employee WHERE cast(start_date as date) = '2020-07-01' -- here we remove from a predefined day GROUP BY employee_id, start_date """ ) .show();

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions