blob: 34593ec42ff89d98a08dd501fc1a122e268293aa [file] [log] [blame]
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="description" content="Apache Object Oriented Data Technology (OODT) is the smart way to integrate and archive your processes, your data, and its metadata. It facilitates the generation, processing, management, distribution, analysis of data management, data archiving, and data analytics systems allowing for the integration of data, computation, visualization and other components.
">
<meta name="author" content="Apache OODT">
<title>Apache OODT - Distributed Data Management</title>
<!-- Bootstrap Core CSS -->
<link href="/css/bootstrap.min.css" rel="stylesheet">
<!-- Custom CSS -->
<link href="/css/stylish-portfolio.css" rel="stylesheet">
<!-- Custom Fonts -->
<link href="/font-awesome/css/font-awesome.min.css" rel="stylesheet" type="text/css">
<link href="http://fonts.googleapis.com/css?family=Source+Sans+Pro:300,400,700,300italic,400italic,700italic" rel="stylesheet" type="text/css">
<!-- HTML5 Shim and Respond.js IE8 support of HTML5 elements and media queries -->
<!-- WARNING: Respond.js doesn't work if you view the page via file:// -->
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/libs/html5shiv/3.7.0/html5shiv.js"></script>
<script src="https://oss.maxcdn.com/libs/respond.js/1.4.2/respond.min.js"></script>
<![endif]-->
</head>
<body>
<nav class="navbar navbar-default">
<div class="container-fluid">
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#bs-example-navbar-collapse-1" aria-expanded="false">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="navbar-brand" href="#"><img src="/img/oodt_asf_logo_sm.png"/></a>
</div>
<div class="collapse navbar-collapse" id="bs-example-navbar-collapse-1">
<ul class="nav navbar-nav">
<!--<a id="menu-close" href="#" class="btn btn-light btn-lg pull-right toggle"><i class="fa fa-times"></i></a>-->
<li>
<a href="/">Home</a>
</li>
<li>
<a href="/blog">News</a>
<li>
<a href="/site_docs/oodt/index.html">Components</a>
</li>
<li>
<a href="/documentation.html">Documentation</a>
</li>
<li>
<a href="/community.html" >Community</a>
</li>
<li>
<a href="/users.html" >Users</a>
</li>
<li>
<a href="/download.html" >Download</a>
</li>
</ul>
</div>
</div>
</nav>
<div class="row col-md-6 col-md-offset-3">
<h1>
Tracking Data Inside Your Data Pool
</h1>
<p>
Storage is cheap, so businesses like to store data, lots of it. This is fine but it adds complexity to the data storage pool, data locality, what data is stored and so on.
What OODT provides is a way for users and administrators to keep track of the data that is available to them and search through sets of data with relative ease.
</p>
<h2>Track data by metadata</h2>
<p>
By putting OODT in front of your staging area or data storage pool, you can use metadata extractors like Apache Tika to extract data from the files you plan to ingest
into your data warehouse. For example, embedded in every Excel spreadsheet will be the creator, the last edited date and so on which can be valuable to users because
it allows them to discover data by metadata instead of by folder or file name. Should you use HDFS or another clustered file system you can also add connectivity to that
file system with relative ease, allowing you instant ingestion and tracking via OODT whilst allowing your NOSQL jobs to run over the content instantly.
</p>
<h2>Solr Analysis</h2>
<p>
By pumping your metadata catalog into Solr you can have instant analysis at your fingertips. Front Solr with Banana and you can ask questions of your metadata, giving
analysts another entry point into your data warehouse.
</p>
</div>
<footer class="sitefooter">
<div class="container">
<div class="row">
<div class="col-lg-10 col-lg-offset-1 text-center">
<a href="http://www.apache.org/events/current-event.html">
<img src="http://www.apache.org/events/current-event-234x60.png"/>
</a>
<h4><strong>Apache OODT</strong>
</h4>
<p>
</p>
<ul class="list-unstyled">
<li><i class="fa fa-envelope-o fa-fw"></i> <a href="mailto:dev@oodt.apache.org">dev@oodt.apache.org</a>
</li>
</ul>
<br>
<ul class="list-inline">
<!-- <li>
<a href="https://facebook.com/"><i class="fa fa-facebook fa-fw fa-3x"></i></a>
</li>-->
<li>
<a href="https://twitter.com/apache_oodt"><i class="fa fa-twitter fa-fw fa-3x"></i></a>
</li>
<li>
<a href="https://github.com/apache/oodt"><i class="fa fa-github fa-fw fa-3x"></i></a>
</li>
</ul>
<hr class="small">
<p class="text-muted">Copyright &copy; The Apache Foundation 2018</p>
</div>
</div>
</div>
</footer>
<!-- jQuery -->
<script src="/js/jquery.js"></script>
<!-- Bootstrap Core JavaScript -->
<script src="/js/bootstrap.min.js"></script>
<!-- Custom Theme JavaScript -->
<script src="/js/custom.js"></script>
</body>
</html>