blob: d2592e78133205d65b159e3d1eb03a2133e3031e [file]
<!DOCTYPE HTML>
<html lang>
<head>
<!-- Generated by javadoc (17) on Sat Jun 13 20:43:56 UTC 2026 -->
<title>TableProvider (Apache DataFusion Java 0.2.0-SNAPSHOT)</title>
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta name="dc.created" content="2026-06-13">
<meta name="description" content="declaration: package: org.apache.datafusion, interface: TableProvider">
<meta name="generator" content="javadoc/ClassWriterImpl">
<link rel="stylesheet" type="text/css" href="../../../stylesheet.css" title="Style">
<link rel="stylesheet" type="text/css" href="../../../script-dir/jquery-ui.min.css" title="Style">
<link rel="stylesheet" type="text/css" href="../../../jquery-ui.overrides.css" title="Style">
<script type="text/javascript" src="../../../script.js"></script>
<script type="text/javascript" src="../../../script-dir/jquery-3.7.1.min.js"></script>
<script type="text/javascript" src="../../../script-dir/jquery-ui.min.js"></script>
</head>
<body class="class-declaration-page">
<script type="text/javascript">var evenRowColor = "even-row-color";
var oddRowColor = "odd-row-color";
var tableTab = "table-tab";
var activeTableTab = "active-table-tab";
var pathtoroot = "../../../";
loadScripts(document, 'script');</script>
<noscript>
<div>JavaScript is disabled on your browser.</div>
</noscript>
<div class="flex-box">
<header role="banner" class="flex-header">
<nav role="navigation">
<!-- ========= START OF TOP NAVBAR ======= -->
<div class="top-nav" id="navbar-top">
<div class="skip-nav"><a href="#skip-navbar-top" title="Skip navigation links">Skip navigation links</a></div>
<ul id="navbar-top-firstrow" class="nav-list" title="Navigation">
<li><a href="../../../index.html">Overview</a></li>
<li><a href="package-summary.html">Package</a></li>
<li class="nav-bar-cell1-rev">Class</li>
<li><a href="class-use/TableProvider.html">Use</a></li>
<li><a href="package-tree.html">Tree</a></li>
<li><a href="../../../index-all.html">Index</a></li>
<li><a href="../../../help-doc.html#class">Help</a></li>
</ul>
</div>
<div class="sub-nav">
<div>
<ul class="sub-nav-list">
<li>Summary:&nbsp;</li>
<li>Nested&nbsp;|&nbsp;</li>
<li>Field&nbsp;|&nbsp;</li>
<li>Constr&nbsp;|&nbsp;</li>
<li><a href="#method-summary">Method</a></li>
</ul>
<ul class="sub-nav-list">
<li>Detail:&nbsp;</li>
<li>Field&nbsp;|&nbsp;</li>
<li>Constr&nbsp;|&nbsp;</li>
<li><a href="#method-detail">Method</a></li>
</ul>
</div>
<div class="nav-list-search"><label for="search-input">SEARCH:</label>
<input type="text" id="search-input" value="search" disabled="disabled">
<input type="reset" id="reset-button" value="reset" disabled="disabled">
</div>
</div>
<!-- ========= END OF TOP NAVBAR ========= -->
<span class="skip-nav" id="skip-navbar-top"></span></nav>
</header>
<div class="flex-content">
<main role="main">
<!-- ======== START OF CLASS DATA ======== -->
<div class="header">
<div class="sub-title"><span class="package-label-in-type">Package</span>&nbsp;<a href="package-summary.html">org.apache.datafusion</a></div>
<h1 title="Interface TableProvider" class="title">Interface TableProvider</h1>
</div>
<section class="class-description" id="class-description">
<dl class="notes">
<dt>All Known Implementing Classes:</dt>
<dd><code><a href="SimpleTableProvider.html" title="class in org.apache.datafusion">SimpleTableProvider</a></code></dd>
</dl>
<hr>
<div class="type-signature"><span class="modifiers">public interface </span><span class="element-name type-name-label">TableProvider</span></div>
<div class="block">A Java-implemented table that can be registered with a <a href="SessionContext.html" title="class in org.apache.datafusion"><code>SessionContext</code></a> via <a href="SessionContext.html#registerTable(java.lang.String,org.apache.datafusion.TableProvider)"><code>SessionContext.registerTable(String, TableProvider)</code></a>. Mirrors the role of DataFusion's Rust
<code>TableProvider</code> trait, but at present only exposes the methods needed for a full table
scan; future versions may add filter/projection pushdown and multi-partition support as default
methods so existing implementations keep working.
<p><a href="SimpleTableProvider.html" title="class in org.apache.datafusion"><code>SimpleTableProvider</code></a> is a ready-made implementation for the common case of "I have a
schema and a function that returns an <code>ArrowReader</code>".
<p>Each call to <a href="#scan(org.apache.arrow.memory.BufferAllocator)"><code>scan(BufferAllocator)</code></a> must return a fresh, independent <code>ArrowReader</code> so that queries which touch the table more than once (self-joins, <code>UNION ALL</code>,
repeated reads) work correctly. The returned reader is closed by the framework when the stream
ends.
<p>The schema returned by <a href="#schema()"><code>schema()</code></a> is captured once at registration time. Every batch
produced by every <code>ArrowReader</code> returned from <a href="#scan(org.apache.arrow.memory.BufferAllocator)"><code>scan(BufferAllocator)</code></a> must conform
to it; a mismatch fails the query.</div>
</section>
<section class="summary">
<ul class="summary-list">
<!-- ========== METHOD SUMMARY =========== -->
<li>
<section class="method-summary" id="method-summary">
<h2>Method Summary</h2>
<div id="method-summary-table">
<div class="table-tabs" role="tablist" aria-orientation="horizontal"><button id="method-summary-table-tab0" role="tab" aria-selected="true" aria-controls="method-summary-table.tabpanel" tabindex="0" onkeydown="switchTab(event)" onclick="show('method-summary-table', 'method-summary-table', 3)" class="active-table-tab">All Methods</button><button id="method-summary-table-tab2" role="tab" aria-selected="false" aria-controls="method-summary-table.tabpanel" tabindex="-1" onkeydown="switchTab(event)" onclick="show('method-summary-table', 'method-summary-table-tab2', 3)" class="table-tab">Instance Methods</button><button id="method-summary-table-tab3" role="tab" aria-selected="false" aria-controls="method-summary-table.tabpanel" tabindex="-1" onkeydown="switchTab(event)" onclick="show('method-summary-table', 'method-summary-table-tab3', 3)" class="table-tab">Abstract Methods</button></div>
<div id="method-summary-table.tabpanel" role="tabpanel" aria-labelledby="method-summary-table-tab0">
<div class="summary-table three-column-summary">
<div class="table-header col-first">Modifier and Type</div>
<div class="table-header col-second">Method</div>
<div class="table-header col-last">Description</div>
<div class="col-first even-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3"><code>org.apache.arrow.vector.ipc.ArrowReader</code></div>
<div class="col-second even-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3"><code><a href="#scan(org.apache.arrow.memory.BufferAllocator)" class="member-name-link">scan</a><wbr>(org.apache.arrow.memory.BufferAllocator&nbsp;allocator)</code></div>
<div class="col-last even-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3">
<div class="block">Open a fresh batch stream for this table.</div>
</div>
<div class="col-first odd-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3"><code>org.apache.arrow.vector.types.pojo.Schema</code></div>
<div class="col-second odd-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3"><code><a href="#schema()" class="member-name-link">schema</a>()</code></div>
<div class="col-last odd-row-color method-summary-table method-summary-table-tab2 method-summary-table-tab3">
<div class="block">The fixed schema of this table.</div>
</div>
</div>
</div>
</div>
</section>
</li>
</ul>
</section>
<section class="details">
<ul class="details-list">
<!-- ============ METHOD DETAIL ========== -->
<li>
<section class="method-details" id="method-detail">
<h2>Method Details</h2>
<ul class="member-list">
<li>
<section class="detail" id="schema()">
<h3>schema</h3>
<div class="member-signature"><span class="return-type">org.apache.arrow.vector.types.pojo.Schema</span>&nbsp;<span class="element-name">schema</span>()</div>
<div class="block">The fixed schema of this table. Called once, at registration time.</div>
</section>
</li>
<li>
<section class="detail" id="scan(org.apache.arrow.memory.BufferAllocator)">
<h3>scan</h3>
<div class="member-signature"><span class="return-type">org.apache.arrow.vector.ipc.ArrowReader</span>&nbsp;<span class="element-name">scan</span><wbr><span class="parameters">(org.apache.arrow.memory.BufferAllocator&nbsp;allocator)</span></div>
<div class="block">Open a fresh batch stream for this table. Called once per physical scan of the table — a single
query may invoke this more than once (self-joins, <code>UNION ALL</code> over the same table, etc.).
<p>Each invocation MUST return an independent <code>ArrowReader</code>. The reader's schema MUST
equal <a href="#schema()"><code>schema()</code></a>. The reader's buffers MUST be allocated from <code>allocator</code> (or from
a child of it) — the framework needs the reader's allocator hierarchy to share a root with the
one it passes here. The allocator contract mirrors the one on <a href="ScalarFunction.html#evaluate(org.apache.arrow.memory.BufferAllocator,org.apache.datafusion.ScalarFunctionArgs)"><code>ScalarFunction.evaluate(org.apache.arrow.memory.BufferAllocator, org.apache.datafusion.ScalarFunctionArgs)</code></a>.</div>
</section>
</li>
</ul>
</section>
</li>
</ul>
</section>
<!-- ========= END OF CLASS DATA ========= -->
</main>
<footer role="contentinfo">
<hr>
<p class="legal-copy"><small>Copyright &#169; 2026. All rights reserved.</small></p>
</footer>
</div>
</div>
</body>
</html>