New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Add a separate configuration setting for parallelism of scanning parquet files
enhancement
#924
opened Aug 22, 2021 by
alamb
Optimize performance of regular expression filtering
enhancement
waiting-on-upstream
#905
opened Aug 18, 2021 by
alamb
Optimize
count(col) using table statistics
enhancement
performance
#904
opened Aug 18, 2021 by
Dandandan
Question: is the combination of limit and predicate push-down safe in ParquetExec?
bug
#900
opened Aug 16, 2021 by
andygrove
Track total memory allocation used by DataFusion plans
enhancement
#898
opened Aug 16, 2021 by
alamb
Refactor ParquetExec::try_from_files in preparation for making it parallel
enhancement
#896
opened Aug 16, 2021 by
andygrove
[DataFusion CLI] Support querying CSV files without providing the schema
enhancement
good first issue
help wanted
#888
opened Aug 15, 2021 by
andygrove
Documenting building with simd feature
documentation
good first issue
#882
opened Aug 14, 2021 by
andygrove
Write blog post to announce DataFusion 5.0.0 and Ballista 0.5.0
documentation
enhancement
#881
opened Aug 14, 2021 by
andygrove
DataFusion should scan Parquet statistics once per query
enhancement
performance
#871
opened Aug 13, 2021 by
andygrove
Ballista should serialize Parquet statistics
ballista
enhancement
#868
opened Aug 13, 2021 by
andygrove
ParquetExec should parallelize statistics scan operations
enhancement
performance
#867
opened Aug 13, 2021 by
andygrove
Previous Next
ProTip!
Updated in the last three days: updated:>2021-08-19.