Release Notes v2024.02.02
stress-tess
released this
02 Feb 22:33
·
410 commits
to master
since this release
Bug Fixes
- Issues #2647, #2650, #2661, #2666 - Fix bugs in
filter
,remove_repeats
,append_single
andget_jth
for segarrays with empty segments - Issue #2937 - Fixes string read bug for large Parquet files
Major Updates
- Issue #2853 - Adds
df.merge
on a mix of String and integer columns - Issues #2862, #2863, #2871 - Implement
histogram2d
andhistogramdd
- Issue #2927 - Adds power divergence statistic,
chisquare
, andxlogy
- Issues #2905, #2912, #2914, #2918 - Add
isalpha
,isalnum
,isdigit
, andisempty
for Strings - Issues #2888, #2878 - Add float support to
in1d
and groupby
Minor Updates
- Issues #2873, #2886 - Enable short strings optimization for multi-column groupby
- Issue #2894 - Adds
dropna
option to dataframe groupby - Issue #2461 - Adds
where
argument to trig functions - Issue #2430 - Adds shift equals for pdarray
- Issue #2831 - Aligns
dataframe.groupby().size()
anddataframe.groupby().sum()
with pandas - PRs #2829, #2865, #2876 - Add infrastructure and partial implementation of array API
- PR #2870 - Expands DataFrame initializer
- PR #2931 - Aligns
Series
indexing methods with pandas - Issue #2909 - Renames
is_upper/lower/title
andto_upper/lower/title
to match numpy
Auto-Generated Release Notes
- Scaffolding for multi-dimensional arrays by @jeremiah-corrado in #2829
- Closes #2836: Remove
computeOnSegmentsWithoutAggregation
by @pierce314159 in #2854 - Closes #2857: Move init methods to compat modules for 132 by @bmcdonald3 in #2858
- Closes #2855: Read batches for Parquet strings by @bmcdonald3 in #2856
- Closes #2853:
df.merge
on a mix of String and integer columns by @pierce314159 in #2859 - Closes #2862: Implement
histogram2d
by @pierce314159 in #2864 - Closes #2866: Fix empty domain creation for block arrays by @bmcdonald3 in #2867
- Close #2871: Fix numeric precision issue in
testHistogram
by @pierce314159 in #2872 - DataFrame initializer improvements by @brandon-neth in #2870
- Fixes #2666:
Segarray.filter
bug with empty segments by @pierce314159 in #2860 - Fixes #2661:
segarray.remove_repeats
bug with empty segments by @pierce314159 in #2861 - Closes #2863: Implement
histogramdd
by @pierce314159 in #2868 - Array creation perf patch by @jeremiah-corrado in #2869
- Closes #2875: Clean up deprecations in advance of 1.33 release by @bmcdonald3 in #2874
- Closes #2873: Short strings optimization for multicol groupby by @pierce314159 in #2877
- Only use
tryCreateArray
interface to create 1D arrays by @jeremiah-corrado in #2879 - Fix typo in #2879 by @jeremiah-corrado in #2882
- Closes #2878: Add groupby for floats by @pierce314159 in #2884
- Fixes #2883: Fix ambiguity in
makeDistArray
where clauses by @jeremiah-corrado in #2885 - Closes #2886: Cap small strs groupby optimization by @pierce314159 in #2887
- Closes #2888: Add in1d float support by @pierce314159 in #2889
- Fixes #2650: Fixes
segarray.append_single
with empty segments by @pierce314159 in #2891 - Closes #2461: Implementation of
where
feature for trig functions by @jaketrookman in #2851 - Fixes #2647: Segarray.get_jth bug with bool values by @pierce314159 in #2895
- Array API infrastructure and partial implementation by @jeremiah-corrado in #2865
- Closes ticket #2831 to align dataframe.groupby().size(), dataframe.groupby().sum() to pandas by @ajpotts in #2892
- Adjustments to remove some chpl unstable warnings by @jeremiah-corrado in #2890
- Fix CommAggregation build error on 1.32 by @jeremiah-corrado in #2900
- Closes #2901: Fix performance regression due to setting ndim field in sym entry by @bmcdonald3 in #2902
- Replace use of binary 'or' with logical 'or' by @DanilaFe in #2907
- Closes #2894 add dropna option to dataframe groupby by @ajpotts in #2904
- closes #2430 add shift equals for pdarray by @ajpotts in #2908
- Closes #2905 add strings isalnum by @ajpotts in #2910
- Closes ticket #2909 rename is_upper and similar ops to match numpy by @ajpotts in #2920
- Closes #2922: Pin
pandas<2.2.0
to avoid CI failures by @pierce314159 in #2923 - Closes #2912 add isalpha for pdarray by @ajpotts in #2921
- Apply black formatter to array_api frontend code by @jeremiah-corrado in #2928
- Modify fix to performance regression in #2902 by @jeremiah-corrado in #2930
- Closes #2914-add-isdigit-for-pdarray by @ajpotts in #2925
- Make arkouda server configurable w.r.t. supported types by @jeremiah-corrado in #2876
- Indexing methods for
Series
by @brandon-neth in #2931 - Closes #2937 : Fix string read bug for large Parquet files by @bmcdonald3 in #2938
- Closes #2918 add isempty for pdarray by @ajpotts in #2933
- closes #2927 power divergence statistic by @ajpotts in #2932
New Contributors
Full Changelog: v2023.11.15...v2024.02.02