Skip to content

Release Notes v2023.04.07

Compare
Choose a tag to compare
@Ethan-DeBandi99 Ethan-DeBandi99 released this 07 Apr 13:30
· 661 commits to master since this release
80c4fe6

Bug Fixes

  • Issue #2329 - Fixes continued SegArray read performance issues
  • Issue #2299 - Fixes file writes reporting success when directory does not exist
  • Issue #2297 - Fixes HDF5 single file write stall
  • Issue #2327 - Fixes issue loading 16-bit and 32-bit from Parquet
  • Issue #2337 - Fixes ak.DataFrame.to_parquet with IPv4 columns
  • Issue #2348 - Fixes IPv4 removal from DataFrame
  • Issue #2350 - Fixes DataFrame column subset access
  • Issues #2306, #2317 and PR #2354 - Fix Datetime component scaling
  • Issue #2328 - Fixes bug when printing Dataframes containing bigint
  • Issue #2307 - Fixes reported memory usage exceeding 100 percent
  • Issue #2309 - Fixes error in Groupby.unique on Categorical or Strings
  • Issue #2240 - Fixes ak.coargsort empty String and Categorical bug
  • Issue #2347 - Fixes broken links in README.md

New Features

  • Issue #2050 - Adds File of Origin when loading data from Parquet or HDF5. Use ak.read_tagged_data to return the file origin information. More information on this function can be found here.
  • Issue #2295 - Adds SegArray filters, ak.SegArray.filter() allows values to be removed from SegArray.
  • Issue #2293- Adds ak.where support Strings and Categoricals
  • Issue #2324 - Adds benchmark documentation
  • Issue #2209 - Enhances Arkouda metrics

Minor Updates

  • Issue #2015 - Updates Parquet NaN detection on float/double columns to be more efficient
  • Issue #2280 - Improves max_bits handling
  • Issues #2319, #2320, #2339 - Update to test framework to pytest configuration
Auto-Generated Release Notes

Full Changelog: v2023.03.24...v2023.04.07