Release Notes v2023.04.07
Ethan-DeBandi99
released this
07 Apr 13:30
·
661 commits
to master
since this release
Bug Fixes
- Issue #2329 - Fixes continued SegArray read performance issues
- Issue #2299 - Fixes file writes reporting success when directory does not exist
- Issue #2297 - Fixes HDF5 single file write stall
- Issue #2327 - Fixes issue loading 16-bit and 32-bit from Parquet
- Issue #2337 - Fixes
ak.DataFrame.to_parquet
with IPv4 columns - Issue #2348 - Fixes
IPv4
removal from DataFrame - Issue #2350 - Fixes DataFrame column subset access
- Issues #2306, #2317 and PR #2354 - Fix
Datetime
component scaling - Issue #2328 - Fixes bug when printing Dataframes containing bigint
- Issue #2307 - Fixes reported memory usage exceeding 100 percent
- Issue #2309 - Fixes error in Groupby.unique on Categorical or Strings
- Issue #2240 - Fixes
ak.coargsort
empty String and Categorical bug - Issue #2347 - Fixes broken links in
README.md
New Features
- Issue #2050 - Adds File of Origin when loading data from Parquet or HDF5. Use
ak.read_tagged_data
to return the file origin information. More information on this function can be found here. - Issue #2295 - Adds SegArray filters, ak.SegArray.filter() allows values to be removed from SegArray.
- Issue #2293- Adds
ak.where
support Strings and Categoricals - Issue #2324 - Adds benchmark documentation
- Issue #2209 - Enhances Arkouda metrics
Minor Updates
- Issue #2015 - Updates Parquet NaN detection on float/double columns to be more efficient
- Issue #2280 - Improves
max_bits
handling - Issues #2319, #2320, #2339 - Update to test framework to
pytest
configuration
Auto-Generated Release Notes
- Recommend Chapel 1.30.0 and use it for CI testing by @ronawho in #2291
- Closes #2299 - Fixes file writes reporting success when directory does not exist by @Ethan-DeBandi99 in #2305
- Closes #2297 - Fixes HDF5 Single File Write Stall by @Ethan-DeBandi99 in #2310
- Closes #2280: Improve
max_bits
handling by @pierce314159 in #2303 - Closes #2311: Reenable bulk transfer for developer builds by @bmcdonald3 in #2312
- Closes #2240:
ak.coargsort
empty String and Categorical bug fix by @jaketrookman in #2308 - Closes #2309: Error in Groupby.unique on Categorical or Strings by @pierce314159 in #2315
- Fixes #2306:
Datetime.date
scaling by @pierce314159 in #2316 - Closes #2023 - Added all compression types to legacy benchmarks by @joshmarshall1 in #2302
- Bump bigint groupby benchmark problem size by @ronawho in #2326
- Closes #2295 - SegArray Filters by @Ethan-DeBandi99 in #2325
- Closes #2329 - Fixes Continued SegArray Read Performance Issues by @Ethan-DeBandi99 in #2330
- Closes #2335 - Update Quickstart to point to v2023.03.24 by @Ethan-DeBandi99 in #2336
- Closes #2050 - File of Origin on Read by @Ethan-DeBandi99 in #2323
- Closes #2327 - Fixes Issue Loading 16-bit and 32-bit from Parquet by @Ethan-DeBandi99 in #2333
- Fixes #2328: Bug when printing Dataframes containing bigint by @pierce314159 in #2331
- Closes #2319 - PyTest Test Suite Restructure by @joshmarshall1 in #2314
- Closes #2320 - Adding Pytest structure to
array_view_test.py
by @joshmarshall1 in #2338 - Closes #2339 - Adding PyTest Structure to
bigint_agg_test.py
by @joshmarshall1 in #2340 - 2307 reported mem used greater than 100 percent by @hokiegeek2 in #2346
- Eliminate some communication from datetime methods by @ronawho in #2354
- Closes #2317: Add ArkoudaTimeShim modules to improve DateTimeMsg performance by @bmcdonald3 in #2353
- Closes #2015 - Efficient NaN read from Parquet by @Ethan-DeBandi99 in #2351
- Closes #2337 - Fixes
ak.DataFrame.to_parquet
with IPv4 Columns by @Ethan-DeBandi99 in #2352 - Closes #2348 - Fixes
IPv4
Removal from DataFrame by @Ethan-DeBandi99 in #2358 - Closes #2293:
ak.where
support Strings and Categoricals by @pierce314159 in #2349 - Closes #2362: Start using Chapel 1.30 for building gh-pages by @bmcdonald3 in #2363
- Closes #2347 - Fixes broken links in
README.md
by @Ethan-DeBandi99 in #2360 - Closes #2350 - Fixes DataFrame Column Subset by @Ethan-DeBandi99 in #2359
- Setting pages to v3 checkout by @Ethan-DeBandi99 in #2364
- Updated to deployment v4 by @Ethan-DeBandi99 in #2365
- Closes #2366: Update graph files after IO benchmark changes by @bmcdonald3 in #2367
- Closes #2324 - Benchmark documentation by @joshmarshall1 in #2361
- 2209 enhance Arkouda metrics by @hokiegeek2 in #2332
Full Changelog: v2023.03.24...v2023.04.07