blob: f2408c906bbdc1a452ec6de741aa45cad93edef4 [file] [view]
<!--
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
-->
# SessionContext
`SessionContext` is the entry point into DataFusion from Java. It owns the
catalog of registered tables and the query planner.
## Lifecycle
```java
try (SessionContext ctx = new SessionContext()) {
// register tables, build queries...
}
```
`SessionContext` is `AutoCloseable`. Closing it releases the underlying
native context. Use `try`-with-resources so the native side is freed even
on exception.
## Threading
A `SessionContext` is **not thread-safe**. Do not share one across threads
without external synchronization. The simplest pattern is one context per
thread.
## Configuration
`SessionContext.builder()` exposes a fluent builder for overriding
DataFusion defaults — batch size, target partitions, statistics
collection, information schema, memory pool size, and the spill
directory. See the
<!-- Raw HTML link: MyST resolves relative .html Markdown links as
cross-references, which fails sphinx-build -W. The Javadoc tree is
copied verbatim via html_extra_path and is unknown to Sphinx. -->
<a href="../api/org/apache/datafusion/SessionContextBuilder.html"><code>SessionContextBuilder</code></a>
Javadoc for the full list.
```java
try (SessionContext ctx = SessionContext.builder()
.batchSize(4096)
.targetPartitions(8)
.build()) {
// ...
}
```
## Spark-compatible functions
`withSparkFunctions()` registers Apache Spark–compatible functions and
expression planners (from the
[`datafusion-spark`](https://crates.io/crates/datafusion-spark) crate) on the
context:
```java
try (SessionContext ctx = SessionContext.builder()
.withSparkFunctions()
.build();
DataFrame df = ctx.sql("SELECT crc32('Spark')")) {
// ...
}
```
When enabled, Spark-compatible functions override any DataFusion built-in of
the same name. This requires the native library to be built with the `spark`
Cargo feature, which is enabled in the default build; otherwise `build()`
throws explaining the feature is missing.