All Versions
71
Latest Version
Avg Release Cycle
6 days
Latest Release
1226 days ago

Changelog History
Page 3

  • v0.0.91

    August 09, 2020
  • v0.0.90

    August 09, 2020
  • v0.0.89

    August 09, 2020
  • v0.0.88 Changes

    August 02, 2020
    1. Long to Timestamp conversion UDF
    2. Deequ version dependency fix
    3. Data Quality - Added new data validations to Metorikku based on Amazon Deequ: hasSize, hasUniqueness
  • v0.0.87 Changes

    July 28, 2020

    ๐Ÿ†• New

    1. Data Quality - Added data validations to Metorikku based on Amazon Deequ (IsUnique, IsComplete)
    2. LoadIfExist- UDF to attempt to load a table otherwise return an empty DF by specific schema.
  • v0.0.86

    July 01, 2020
  • v0.0.85

    July 01, 2020
  • v0.0.84 Changes

    May 25, 2020

    ๐Ÿ†• New

    ๐Ÿ‘ท 1. Added new mode called periodic, basically runs a batch job in a loop ๐Ÿ”€ 2. Selective merge UDF ๐Ÿ‘ท 3. Added lag reporting to batch and streaming jobs

    1. Added empty output protection ๐Ÿ‘ 5. Support reading metrics from remote location (s3, HDFS)

    ๐Ÿ‘Œ Improvements

    ๐Ÿšš 1. Hudi: removeNullColumns flag to remove null columns before writing to hudi ๐Ÿšš 2. Hudi: deletePendingCompactions flag removes pending compactions when running in streaming ๐Ÿ”€ 3. Hudi: added a manual hive sync mode (this helps hive 1 users)

    1. Hudi: added hudiTableName parameter ๐Ÿ‘ 5. Hive: support hive 2.3
    2. Started releasing assembled JAR to maven central

    ๐Ÿ›  Fixes

    1. Hudi: fix metrics reporting when using hudi in streaming โฌ†๏ธ 2. Hudi: upgrade to 0.5.2 (breaking change) ๐Ÿ”จ 3. Refactored build process
  • v0.0.83

    May 19, 2020
  • v0.0.82

    May 19, 2020