-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Insights: apache/arrow
September 14, 2024 – September 21, 2024
Overview
Could not load contribution data
Please try again later
30 Pull requests merged by 14 people
-
GH-43960: [R] fix
str_sub
binding to properly handle negativeend
values#44141 merged
Sep 21, 2024 -
MINOR: [Docs] Update implementation matrix for view types in arrow-rs
#44175 merged
Sep 20, 2024 -
GH-40493: [GLib][Ruby] Add GArrowStreamDecoder
#44170 merged
Sep 20, 2024 -
GH-39982: [Java] Add RunEndEncodedVector
#43888 merged
Sep 20, 2024 -
GH-44008: [C++][Parquet] Add support for arrow::ArrayStatistics: boolean
#44009 merged
Sep 19, 2024 -
GH-44155: [Archery][Integration] Rename "language" to "implementation"
#44156 merged
Sep 19, 2024 -
GH-43873: [Go][CI] Remove Go related test CI
#44143 merged
Sep 19, 2024 -
MINOR: [Docs] Fix number of minor format versions since 1.0.0
#44163 merged
Sep 18, 2024 -
MINOR: [C++][CI] Move ThreadSanitizer build to Ubuntu 24.04
#44159 merged
Sep 18, 2024 -
GH-43964: [Python] Build macOS and manylinux wheels for free-threading
#43965 merged
Sep 18, 2024 -
GH-44052: [C++][Compute] Reduce the complexity of row segmenter
#44053 merged
Sep 18, 2024 -
GH-44153: [GLib][FlightRPC] Fix closure annotation
#44154 merged
Sep 18, 2024 -
GH-43868: [CI][Python] Skip test that requires PARQUET_TEST_DATA env on emscripten
#43906 merged
Sep 18, 2024 -
GH-43874: [CI][Integration][Go] Use apache/arrow-go
#44142 merged
Sep 18, 2024 -
GH-43875: [Go][CI] Remove Go related lint configurations
#44144 merged
Sep 18, 2024 -
GH-37756: [Format][Docs] Document IPC Compression
#43950 merged
Sep 17, 2024 -
GH-43809: [Docs] Update extension type examples to not use UUID
#44120 merged
Sep 17, 2024 -
GH-44149: [Packaging][CI] Remove references to deprecated Ubuntu bionic
#44150 merged
Sep 17, 2024 -
MINOR: [Docs][Python] Fix return type in docstring for Array.slice
#44134 merged
Sep 17, 2024 -
GH-44098: [C++] Add home made _mm256_set_m128i for compilers who are missing it
#44116 merged
Sep 17, 2024 -
MINOR: [Dev][Archery][Integration] Remove debug prints
#44140 merged
Sep 17, 2024 -
GH-44062: [Dev][Archery][Integration] Reduce needless test matrix
#44099 merged
Sep 16, 2024 -
GH-44127: [CI][R] Fix util_enable_core_dumps.sh path
#44128 merged
Sep 16, 2024 -
GH-43518: [Python][Packaging][CI] Drop Python 3.8 support
#43970 merged
Sep 16, 2024 -
GH-44111: [CI][Python] Enable S3 tests on macOS CI
#44129 merged
Sep 16, 2024 -
MINOR: [Archery] Fix typo on docker CI comment
#44130 merged
Sep 16, 2024 -
GH-44085: [CI][R] Update Ubuntu version for R force test
#44087 merged
Sep 16, 2024 -
GH-43267: [C#] Correctly import sliced arrays through the C Data interface
#44117 merged
Sep 16, 2024 -
GH-44122: [R] Don't use the new pipe yet
#44123 merged
Sep 15, 2024 -
GH-44007: [GLib][Parquet] Add
gparquet_arrow_file_writer_new_buffered_row_group()
#44100 merged
Sep 15, 2024
14 Pull requests opened by 11 people
-
GH-44114: [R] Add Rocky and opensuse to the allowlist for libarrow binaries
#44124 opened
Sep 15, 2024 -
GH-44125: [Python] Add concat_recordbatches function
#44126 opened
Sep 15, 2024 -
Target .NET Framework 4.7.2
#44133 opened
Sep 16, 2024 -
GH-43876: [CI][Swift] Use apache/arrow-go remote repository instead of arrow/go subfolder
#44145 opened
Sep 17, 2024 -
GH-43846: [Python][Packaging] Remove numpy dependency from pyarrow packaging
#44148 opened
Sep 17, 2024 -
GH-40570: [CI] Default environment to Ubuntu 22.04 instead of 20.04
#44151 opened
Sep 17, 2024 -
GH-40653: [C++] Avoid running more tasks in `~SerialExecutor`
#44162 opened
Sep 18, 2024 -
GH-44167: [C++][Acero] Add more row segmenter tests
#44166 opened
Sep 19, 2024 -
GH-44168: [Python][Acero] Provide method to perform aggregations with acero for datasets
#44169 opened
Sep 19, 2024 -
MINOR: [Python] fix pandas_compat.py
#44171 opened
Sep 19, 2024 -
GH-43878: [Go][Release] Remove Go related codes from our release scripts
#44172 opened
Sep 19, 2024 -
GH-33618: [C++] Refactor LocalFileSystem internals to improve recursive listing performance
#44176 opened
Sep 20, 2024 -
GH-44158: [Archery][Integration] Add more explanation how --target-implementations works
#44177 opened
Sep 21, 2024 -
GH-44178: [GLib][Ruby][Flight] Allow setting CallOption timeout
#44179 opened
Sep 21, 2024
58 Issues closed by 16 people
-
[R] stringr binding for `str_sub()` silently mishandles negative start/stop values
#43960 closed
Sep 21, 2024 -
[GLib] Add GArrowStreamDecoder
#40493 closed
Sep 20, 2024 -
[Java] Add RunEndEncodingVector
#39982 closed
Sep 20, 2024 -
[C++][Parquet] Add support for arrow::ArrayStatistics: boolean
#44008 closed
Sep 19, 2024 -
[R] S3 support not enabled for r-universe builds on MacOS
#43030 closed
Sep 19, 2024 -
[C#] Use "asf" organization in NuGet
#43930 closed
Sep 19, 2024 -
Binary View Variadic Buffer Sizes are int64_t but variadic buffers only support int32_t
#44165 closed
Sep 19, 2024 -
[Archery][Integration] Rename "language" to "implementation"
#44155 closed
Sep 19, 2024 -
[Go][CI] Remove Go related test CI except integration test
#43873 closed
Sep 19, 2024 -
[C++][Python] write_csv / WriteCSV sometimes duplicates header
#37903 closed
Sep 18, 2024 -
[C++] ABI break in patch release 15.0.1
#40604 closed
Sep 18, 2024 -
[Python] Create simple HTTP server example using FastAPI
#40869 closed
Sep 18, 2024 -
[C++] Add `bool` operator support to `arrow::Result`
#42036 closed
Sep 18, 2024 -
[Python] OSError: Expected to be able to read 1013159856 bytes for message body, got 513144496
#44146 closed
Sep 18, 2024 -
[Python] Build wheels for the 3.13 free-threaded build
#43964 closed
Sep 18, 2024 -
[C++][Compute] Row segmenter inefficiency
#44052 closed
Sep 18, 2024 -
[CI][C++][Go] Don't run jobs needs arm64 self-hosted runner on fork
#34975 closed
Sep 18, 2024 -
[GLib][FlightRPC] ArrowFlight: invalid "closure" annotation: only valid on callback parameters
#44153 closed
Sep 18, 2024 -
[Python][Parquet][Emscripten] PARQUET_TEST_DATA isn't set in test-conda-python-emscripten
#43868 closed
Sep 18, 2024 -
[Go][CI] Use apache/arrow-go for integration test
#43874 closed
Sep 18, 2024 -
[Go][CI] Remove Go related lint configurations
#43875 closed
Sep 18, 2024 -
[R] R arrow column selection bug with tidyselect
#44138 closed
Sep 17, 2024 -
[Format][Docs] Document IPC Compression
#37756 closed
Sep 17, 2024 -
[Docs] Update extension type examples to not use UUID
#43809 closed
Sep 17, 2024 -
[Packaging] Remove unnecessary references to Ubuntu bionic
#44149 closed
Sep 17, 2024 -
[C++][Compute] `_mm256_set_m128i()` is unavailable on OpenSUSE 15.1 (GCC 7.5)
#44098 closed
Sep 17, 2024 -
[CI][GLib][FlightRPC] Crash in `gaflight::ServerCustomAuthHandler::Authenticate`
#37114 closed
Sep 17, 2024 -
[Dev][Archery][Integration] Reduce needless test matrix
#44062 closed
Sep 16, 2024 -
[CI][R] r-binary-packages failed with wrong util_enable_core_dumps.sh path
#44127 closed
Sep 16, 2024 -
[Python][Packaging] Drop Python 3.8 support
#43518 closed
Sep 16, 2024 -
[C++][Parquet] Incorporate DELTA_BINARY_PACKED value encoder into library and add unit tests
#42394 closed
Sep 16, 2024 -
[Python][CI] Enable S3 tests on macOS builds
#44111 closed
Sep 16, 2024 -
[R] Add Raspberry Pi CI build to nightlies
#30016 closed
Sep 16, 2024 -
[C++][CI] Revisit the flaky S3 tests caused by recent minio
#24340 closed
Sep 16, 2024 -
[CI] Update the conda docker images to use miniforge instead of miniconda
#26242 closed
Sep 16, 2024 -
[CI] Fix CMake Error: Unknown argument -isystem
#28522 closed
Sep 16, 2024 -
[CI] Docker push is failing with authentication fail
#18785 closed
Sep 16, 2024 -
[CI] Docker Push step authentication is failing
#29544 closed
Sep 16, 2024 -
[C++] [CI] macOS failures on githubactions runs
#18874 closed
Sep 16, 2024 -
[CI][Python] Ability to include pip packages in the conda environments
#30489 closed
Sep 16, 2024 -
[CI] AppVeyor build occationally fails due to file permissions
#30568 closed
Sep 16, 2024 -
[CI] test-conda-cpp-valgrind nightly build doesn't finish
#30839 closed
Sep 16, 2024 -
[CI][Gandiva] Travis osx nightly build is failing due to homebrew llvm upgrade
#26338 closed
Sep 16, 2024 -
[CI] Docker push fails on Travis-CI
#30541 closed
Sep 16, 2024 -
[CI][Java][Flight] Test failure on s390x
#31617 closed
Sep 16, 2024 -
[Python] test_filesystem_dataset_no_filesystem_interaction segfault on s390x
#32410 closed
Sep 16, 2024 -
[C++] arrow-compute-asof-join-node-test crashes on s390x
#33476 closed
Sep 16, 2024 -
[CI] Migrate jobs on Travis CI to dev/tasks/
#20496 closed
Sep 16, 2024 -
[CI] Travis-CI should not run on unrelated changes
#33135 closed
Sep 16, 2024 -
[CI][C++][Python] s390x builds time out
#32951 closed
Sep 16, 2024 -
[CI][Python][C++] Avoid compiling grpcio from source
#33630 closed
Sep 16, 2024 -
[CI][C++][Python] Dump C/C++ stack trace on crashes/core dumps
#41209 closed
Sep 16, 2024 -
[C++][Parquet][CI] DatasetEncryptionTest.WriteReadDatasetWithEncryption failed
#41710 closed
Sep 16, 2024 -
[CI][R] Upgrade AMD64 Ubuntu 20.04 R 4.4 Force-Tests true to use newer Ubuntu
#44085 closed
Sep 16, 2024 -
[C#] C Data Interface import computes incorrect buffer sizes when offset is non-zero
#43267 closed
Sep 16, 2024 -
[R] Don't use the new pipe yet
#44122 closed
Sep 15, 2024 -
[GLib][Parquet] Add `gparquet_arrow_file_writer_new_buffered_row_group()`
#44007 closed
Sep 15, 2024
19 Issues opened by 18 people
-
TypesScript type error in ts/builder/map.ts causes TypeScript compilation to fail
#44180 opened
Sep 21, 2024 -
[GLib][Ruby][Flight] allow setting CallOption timeout
#44178 opened
Sep 21, 2024 -
[R] Support integer date and time classes from data.table
#44174 opened
Sep 19, 2024 -
[CI][Python] Some verification tasks on Ubuntu 20.04 fail because we have dropped support for Python 3.8
#44173 opened
Sep 19, 2024 -
[Python][Acero] Provide method to perform aggregations with acero for datasets
#44168 opened
Sep 19, 2024 -
[C++][Acero] More tests for row segmenter
#44167 opened
Sep 19, 2024 -
[Python][C++] Offset Overflow when calling combine_chunks on Large Struct Arrays
#44164 opened
Sep 18, 2024 -
[C++] DecimalRealConversion could multiply by 5 instead of 10
#44161 opened
Sep 18, 2024 -
[Python] [FlightRPC] Index with value of 0 is out-of-bounds for array of length 0
#44160 opened
Sep 18, 2024 -
[Archery][Integration] Add more explanation how `--target-implementations` works
#44158 opened
Sep 18, 2024 -
[Java] Make RangeEqualsVisitor of RunEndEncodedVector more efficient
#44157 opened
Sep 18, 2024 -
[Release][Go] Remove the Go related parts from the verification script
#44152 opened
Sep 18, 2024 -
[Python] Memory leak when using engine="pyarrow" while reading csv files via pd.read_csv()
#44147 opened
Sep 17, 2024 -
[C#] Support compressed RecordBatches via Flight
#44137 opened
Sep 16, 2024 -
[JavaScript] Can't create table if it contains an array of strings ("Unable to infer Vector type")
#44136 opened
Sep 16, 2024 -
Apache Arrow Flight Server (Data as a Service)
#44135 opened
Sep 16, 2024 -
Running async code inside do_exchange
#44132 opened
Sep 16, 2024 -
[Python] Bump minimum required versions of numpy and pandas
#44131 opened
Sep 16, 2024 -
[Python] There is no API to concatenate recordbatches
#44125 opened
Sep 15, 2024
66 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
GH-43956: [C++][Format] Add initial Decimal32/Decimal64 implementations
#43957 commented on
Sep 20, 2024 • 105 new comments -
GH-43631: [C][Format] Add ArrowAsyncDeviceStreamHandler interface
#43632 commented on
Sep 21, 2024 • 62 new comments -
Proof-of-concept Parquet GEOMETRY logical type implementation
#43977 commented on
Sep 20, 2024 • 21 new comments -
GH-44088: [Java] Fix copyFrom in BaseVariableWidthViewVector
#44078 commented on
Sep 20, 2024 • 14 new comments -
GH-43911: [C++] Compute Row: ListKeyEncoder Supports
#43912 commented on
Sep 18, 2024 • 13 new comments -
GH-43535: [C++] support the AWS S3 SSE-C encryption
#43601 commented on
Sep 21, 2024 • 11 new comments -
GH-43589: [Python] Add bindings for Buffer copy() method to other device
#43590 commented on
Sep 20, 2024 • 10 new comments -
GH-40343: [C++] Move S3FileSystem to the registry
#41559 commented on
Sep 20, 2024 • 8 new comments -
GH-36954: [Python] Add more FlightInfo / FlightEndpoint attributes
#43537 commented on
Sep 18, 2024 • 6 new comments -
GH-41551: [C++] Remove boost::filesystem in favor of std::filesystem
#44005 commented on
Sep 18, 2024 • 1 new comment -
GH-43994: [C++][Parquet] Fix schema conversion from two-level encoding nested list
#43995 commented on
Sep 20, 2024 • 1 new comment -
GH-44055: [Java] Finalize ErrorProne Warnings to be considered as Errors
#44056 commented on
Sep 18, 2024 • 1 new comment -
GH-41110: [C#] Handle empty stream in ArrowStreamReaderImplementation
#43939 commented on
Sep 19, 2024 • 1 new comment -
GH-41673: [Format][Docs] Add arrow format introductory page
#41593 commented on
Sep 18, 2024 • 1 new comment -
GH-43680: [Integration] Unskip nanoarrow in IPC integration tests
#43715 commented on
Sep 20, 2024 • 1 new comment -
GH-44071: [C++] Leak S3 structures if finalization happens too late
#44090 commented on
Sep 19, 2024 • 1 new comment -
GH-41310: [C++] S3FS Read file using the version id obtained with HEAD call
#41311 commented on
Sep 18, 2024 • 0 new comments -
GH-41891: [C++] Clean up implicit fallthrough warnings
#41892 commented on
Sep 19, 2024 • 0 new comments -
GH-40547: [R][Docs] Add a non-technical introductory R vignette to the functioning of arrow
#40982 commented on
Sep 16, 2024 • 0 new comments -
GH-40592: [C++][Parquet] Implement SizeStatistics
#40594 commented on
Sep 20, 2024 • 0 new comments -
[Java] Remove Java 11 support
#43307 commented on
Sep 20, 2024 • 0 new comments -
[Python] PyArrow Table to Pandas int8 conversion issue
#40815 commented on
Sep 20, 2024 • 0 new comments -
[Python] Reading Hive-style partitioned parquet files from GCS
#30481 commented on
Sep 20, 2024 • 0 new comments -
[Java] Add RunEndEncoding format support
#39015 commented on
Sep 20, 2024 • 0 new comments -
GH-41094: [C++] Support scalar expression special form
#42106 commented on
Sep 18, 2024 • 0 new comments -
MINOR: [CI] Bump alpine version
#43354 commented on
Sep 18, 2024 • 0 new comments -
GH-43440: [R] Unable to filter a factor column with %in%
#43446 commented on
Sep 21, 2024 • 0 new comments -
GH-43410: [Python] Support Arrow PyCapsule stream objects in write_dataset
#43771 commented on
Sep 18, 2024 • 0 new comments -
GH-33592: [C++] support casting nullable fields to non-nullable if there are no null values
#43782 commented on
Sep 19, 2024 • 0 new comments -
GH-43953: [C++] Add tests based on random data and benchmarks to ChunkResolver::ResolveMany
#43954 commented on
Sep 19, 2024 • 0 new comments -
GH-44066: [Python] Add Python wrapper for JsonExtensionType
#44070 commented on
Sep 17, 2024 • 0 new comments -
GH-41706: [C++][Acero] Modify asof_join_node test to provoke issue
#44083 commented on
Sep 17, 2024 • 0 new comments -
GH-20981: [Flight][Integration] Generic gRPC Test Runner and Server
#44115 commented on
Sep 19, 2024 • 0 new comments -
[C++] Copy bitmap all at once when casting from string-view to offset string and binary types
#43573 commented on
Sep 20, 2024 • 0 new comments -
[C++] write_csv tries to write binary columns as UTF-8 and fails if they're invalid
#41962 commented on
Sep 18, 2024 • 0 new comments -
[Docs] Document conventions for sending and receiving Arrow data over HTTP APIs
#40465 commented on
Sep 18, 2024 • 0 new comments -
Build fails in util/pcre.h due to missing #include <cstdint>
#43350 commented on
Sep 18, 2024 • 0 new comments -
[Python] Support for the free-threaded build of CPython 3.13
#43536 commented on
Sep 18, 2024 • 0 new comments -
[Python][S3] Segfault when using S3FileSystem in uwsgi
#44071 commented on
Sep 18, 2024 • 0 new comments -
[CI] Default environment jobs to Ubuntu 22.04 instead of Ubuntu 20.04
#40570 commented on
Sep 17, 2024 • 0 new comments -
[R] CRAN packaging checklist for version 17.0.0
#43317 commented on
Sep 17, 2024 • 0 new comments -
[Python][Packaging] Remove numpy required dependency from pyarrow packaging
#43846 commented on
Sep 17, 2024 • 0 new comments -
[Python] BUG: Process hangs indefinitely on UnicodeDecodeError When use_threads=True in pyarrow.csv.read_csv
#43892 commented on
Sep 17, 2024 • 0 new comments -
[Swift] Use apache/arrow-go
#43876 commented on
Sep 17, 2024 • 0 new comments -
[Python] Implement pa.array() with type=union type
#19157 commented on
Sep 16, 2024 • 0 new comments -
join_asof out-of-order error for big sorted tables
#41706 commented on
Sep 16, 2024 • 0 new comments -
[CI][Dev] Add shell script linter
#43080 commented on
Sep 16, 2024 • 0 new comments -
[CI][Python][C++] Support on Power Architecture
#43817 commented on
Sep 16, 2024 • 0 new comments -
[Python][CI] Tests involving fastparquet are never run
#37853 commented on
Sep 16, 2024 • 0 new comments -
[R] Add Rocky and opensuse to the allowlist for libarrow binaries
#44114 commented on
Sep 15, 2024 • 0 new comments -
pa.compute.sum result for decimal128 doesn't fit into precision/scale
#35166 commented on
Sep 19, 2024 • 0 new comments -
[C++][Parquet] Add support for arrow::ArrayStatistics
#43549 commented on
Sep 19, 2024 • 0 new comments -
[Python][S3] Segmentation fault when running multithreading in Docker
#39703 commented on
Sep 19, 2024 • 0 new comments -
[Go][Release] Remove Go related codes from our release scripts
#43878 commented on
Sep 19, 2024 • 0 new comments -
[Python] Cannot read data if endpoint is s3 on a "secure" Minio server
#40754 commented on
Sep 19, 2024 • 0 new comments -
[Python] Reading partial data/first block hangs on some cloud filesystems
#43497 commented on
Sep 18, 2024 • 0 new comments -
[C++] std::aligned_storage is deprecated since C++23
#41536 commented on
Sep 18, 2024 • 0 new comments -
[CI][C++] Fix arrow-s3fs-test timeouts on macOS C++ job
#40410 commented on
Sep 18, 2024 • 0 new comments -
[C++] Rename the fixed-width concept from fixed_width_internal.h to "generalized fixed-width"
#41963 commented on
Sep 18, 2024 • 0 new comments -
[C++] Stream write support in ArrayBuilders
#41206 commented on
Sep 18, 2024 • 0 new comments -
[C++] Lengthy destruction of ScannerRecordBatchReader
#40653 commented on
Sep 18, 2024 • 0 new comments -
[CI][Python] Consider installing `azurite` and `minio` for Mac OS python tests
#40509 commented on
Sep 18, 2024 • 0 new comments -
[C++][Python] Duplicate csv header when table batches start with empty
#36889 commented on
Sep 18, 2024 • 0 new comments -
[C++] Move fsspec FileSystem to a separate module
#40344 commented on
Sep 18, 2024 • 0 new comments -
[C++][Parquet] Build fails on illumos
#41338 commented on
Sep 18, 2024 • 0 new comments -
Possibility of declaring RandomAccessFile::GetStream as a virtual function?
#41616 commented on
Sep 18, 2024 • 0 new comments