| <!DOCTYPE html> |
| <!--[if IE]><![endif]--> |
| <html> |
| |
| <head> |
| <meta charset="utf-8"> |
| <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"> |
| <title>Class HMMChineseTokenizerFactory |
| | Apache Lucene.NET 4.8.0-beta00010 Documentation </title> |
| <meta name="viewport" content="width=device-width"> |
| <meta name="title" content="Class HMMChineseTokenizerFactory |
| | Apache Lucene.NET 4.8.0-beta00010 Documentation "> |
| <meta name="generator" content="docfx 2.56.0.0"> |
| |
| <link rel="shortcut icon" href="https://lucenenet.apache.org/docs/4.8.0-beta00009/logo/favicon.ico"> |
| <link rel="stylesheet" href="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/docfx.vendor.css"> |
| <link rel="stylesheet" href="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/docfx.css"> |
| <link rel="stylesheet" href="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/main.css"> |
| <meta property="docfx:navrel" content="toc.html"> |
| <meta property="docfx:tocrel" content="analysis-smartcn/toc.html"> |
| |
| <meta property="docfx:rel" content="https://lucenenet.apache.org/docs/4.8.0-beta00009/"> |
| |
| </head> |
| <body data-spy="scroll" data-target="#affix" data-offset="120"> |
| <div id="wrapper"> |
| <header> |
| |
| <nav id="autocollapse" class="navbar ng-scope" role="navigation"> |
| <div class="container"> |
| <div class="navbar-header"> |
| <button type="button" class="navbar-toggle" data-toggle="collapse" data-target="#navbar"> |
| <span class="sr-only">Toggle navigation</span> |
| <span class="icon-bar"></span> |
| <span class="icon-bar"></span> |
| <span class="icon-bar"></span> |
| </button> |
| |
| <a class="navbar-brand" href="/"> |
| <img id="logo" class="svg" src="https://lucenenet.apache.org/docs/4.8.0-beta00009/logo/lucene-net-color.png" alt=""> |
| </a> |
| </div> |
| <div class="collapse navbar-collapse" id="navbar"> |
| <form class="navbar-form navbar-right" role="search" id="search"> |
| <div class="form-group"> |
| <input type="text" class="form-control" id="search-query" placeholder="Search" autocomplete="off"> |
| </div> |
| </form> |
| </div> |
| </div> |
| </nav> |
| |
| <div class="subnav navbar navbar-default"> |
| <div class="container hide-when-search"> |
| <ul class="level0 breadcrumb"> |
| <li> |
| <a href="https://lucenenet.apache.org/docs/4.8.0-beta00009/">API</a> |
| <span id="breadcrumb"> |
| <ul class="breadcrumb"> |
| <li></li> |
| </ul> |
| </span> |
| </li> |
| </ul> |
| </div> |
| </div> |
| </header> |
| <div class="container body-content"> |
| |
| <div id="search-results"> |
| <div class="search-list"></div> |
| <div class="sr-items"> |
| <p><i class="glyphicon glyphicon-refresh index-loading"></i></p> |
| </div> |
| <ul id="pagination"></ul> |
| </div> |
| </div> |
| <div role="main" class="container body-content hide-when-search"> |
| |
| <div class="sidenav hide-when-search"> |
| <a class="btn toc-toggle collapse" data-toggle="collapse" href="#sidetoggle" aria-expanded="false" aria-controls="sidetoggle">Show / Hide Table of Contents</a> |
| <div class="sidetoggle collapse" id="sidetoggle"> |
| <div id="sidetoc"></div> |
| </div> |
| </div> |
| <div class="article row grid-right"> |
| <div class="col-md-10"> |
| <article class="content wrap" id="_content" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory"> |
| |
| |
| <h1 id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory" class="text-break">Class HMMChineseTokenizerFactory |
| </h1> |
| <div class="markdown level0 summary"><p>Factory for <a class="xref" href="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizer.html">HMMChineseTokenizer</a> |
| <p> |
| Note: this class will currently emit tokens for punctuation. So you should either add |
| a <span class="xref">Lucene.Net.Analysis.Miscellaneous.WordDelimiterFilter</span> after to remove these (with concatenate off), or use the |
| SmartChinese stoplist with a StopFilterFactory via:</p> |
| <pre><code>words="org/apache/lucene/analysis/cn/smart/stopwords.txt"</code></pre> |
| <p><p> |
| <div class="lucene-block lucene-experimental">This is a Lucene.NET EXPERIMENTAL API, use at your own risk</div></div> |
| <div class="markdown level0 conceptual"></div> |
| <div class="inheritance"> |
| <h5>Inheritance</h5> |
| <div class="level0"><span class="xref">System.Object</span></div> |
| <div class="level1"><span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory</span></div> |
| <div class="level2"><span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory</span></div> |
| <div class="level3"><span class="xref">HMMChineseTokenizerFactory</span></div> |
| </div> |
| <div class="inheritedMembers"> |
| <h5>Inherited Members</h5> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.ForName(System.String, System.Collections.Generic.IDictionary<System.String, System.String>)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.LookupClass(System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.AvailableTokenizers</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.ReloadTokenizers()</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.Create(System.IO.TextReader)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.LUCENE_MATCH_VERSION_PARAM</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.m_luceneMatchVersion</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.OriginalArgs</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.AssureMatchVersion()</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.LuceneMatchVersion</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Require(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Require(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Collections.Generic.ICollection<System.String>)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Require(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Collections.Generic.ICollection<System.String>, System.Boolean)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Get(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Get(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Collections.Generic.ICollection<System.String>)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Get(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Collections.Generic.ICollection<System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.Get(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Collections.Generic.ICollection<System.String>, System.String, System.Boolean)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.RequireInt32(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetInt32(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Int32)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.RequireBoolean(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetBoolean(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Boolean)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.RequireSingle(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetSingle(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Single)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.RequireChar(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetChar(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Char)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetSet(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetPattern(System.Collections.Generic.IDictionary<System.String, System.String>, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetCulture(System.Collections.Generic.IDictionary<System.String, System.String>, System.String, System.Globalization.CultureInfo)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetWordSet(Lucene.Net.Analysis.Util.IResourceLoader, System.String, System.Boolean)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetLines(Lucene.Net.Analysis.Util.IResourceLoader, System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetSnowballWordSet(Lucene.Net.Analysis.Util.IResourceLoader, System.String, System.Boolean)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.SplitFileNames(System.String)</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.GetClassArg()</span> |
| </div> |
| <div> |
| <span class="xref">Lucene.Net.Analysis.Util.AbstractAnalysisFactory.IsExplicitLuceneMatchVersion</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.Equals(System.Object)</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.Equals(System.Object, System.Object)</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.GetHashCode()</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.GetType()</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.MemberwiseClone()</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.ReferenceEquals(System.Object, System.Object)</span> |
| </div> |
| <div> |
| <span class="xref">System.Object.ToString()</span> |
| </div> |
| </div> |
| <h6><strong>Namespace</strong>: <a class="xref" href="Lucene.Net.Analysis.Cn.Smart.html">Lucene.Net.Analysis.Cn.Smart</a></h6> |
| <h6><strong>Assembly</strong>: Lucene.Net.Analysis.SmartCn.dll</h6> |
| <h5 id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory_syntax">Syntax</h5> |
| <div class="codewrapper"> |
| <pre><code class="lang-csharp hljs">public sealed class HMMChineseTokenizerFactory : TokenizerFactory</code></pre> |
| </div> |
| <h3 id="constructors">Constructors |
| </h3> |
| <span class="small pull-right mobile-hide"> |
| <span class="divider">|</span> |
| <a href="https://github.com/apache/lucenenet/new/docs/4.8.0-beta00010/websites/apidocs/apiSpec/new?filename=Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory__ctor_System_Collections_Generic_IDictionary_System_String_System_String__.md&value=---%0Auid%3A%20Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.%23ctor(System.Collections.Generic.IDictionary%7BSystem.String%2CSystem.String%7D)%0Asummary%3A%20'*You%20can%20override%20summary%20for%20the%20API%20here%20using%20*MARKDOWN*%20syntax'%0A---%0A%0A*Please%20type%20below%20more%20information%20about%20this%20API%3A*%0A%0A">Improve this Doc</a> |
| </span> |
| <span class="small pull-right mobile-hide"> |
| <a href="https://github.com/NightOwl888/lucenenet/blob/release/Lucene.Net_4_8_0_beta00010/src/Lucene.Net.Analysis.SmartCn/HMMChineseTokenizerFactory.cs/#L42">View Source</a> |
| </span> |
| <a id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory__ctor_" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.#ctor*"></a> |
| <h4 id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory__ctor_System_Collections_Generic_IDictionary_System_String_System_String__" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.#ctor(System.Collections.Generic.IDictionary{System.String,System.String})">HMMChineseTokenizerFactory(IDictionary<String, String>)</h4> |
| <div class="markdown level1 summary"><p>Creates a new <a class="xref" href="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.html">HMMChineseTokenizerFactory</a> </p> |
| </div> |
| <div class="markdown level1 conceptual"></div> |
| <h5 class="decalaration">Declaration</h5> |
| <div class="codewrapper"> |
| <pre><code class="lang-csharp hljs">public HMMChineseTokenizerFactory(IDictionary<string, string> args)</code></pre> |
| </div> |
| <h5 class="parameters">Parameters</h5> |
| <table class="table table-bordered table-striped table-condensed"> |
| <thead> |
| <tr> |
| <th>Type</th> |
| <th>Name</th> |
| <th>Description</th> |
| </tr> |
| </thead> |
| <tbody> |
| <tr> |
| <td><span class="xref">System.Collections.Generic.IDictionary</span><<span class="xref">System.String</span>, <span class="xref">System.String</span>></td> |
| <td><span class="parametername">args</span></td> |
| <td></td> |
| </tr> |
| </tbody> |
| </table> |
| <h3 id="methods">Methods |
| </h3> |
| <span class="small pull-right mobile-hide"> |
| <span class="divider">|</span> |
| <a href="https://github.com/apache/lucenenet/new/docs/4.8.0-beta00010/websites/apidocs/apiSpec/new?filename=Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory_Create_Lucene_Net_Util_AttributeSource_AttributeFactory_System_IO_TextReader_.md&value=---%0Auid%3A%20Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.Create(Lucene.Net.Util.AttributeSource.AttributeFactory%2CSystem.IO.TextReader)%0Asummary%3A%20'*You%20can%20override%20summary%20for%20the%20API%20here%20using%20*MARKDOWN*%20syntax'%0A---%0A%0A*Please%20type%20below%20more%20information%20about%20this%20API%3A*%0A%0A">Improve this Doc</a> |
| </span> |
| <span class="small pull-right mobile-hide"> |
| <a href="https://github.com/NightOwl888/lucenenet/blob/release/Lucene.Net_4_8_0_beta00010/src/Lucene.Net.Analysis.SmartCn/HMMChineseTokenizerFactory.cs/#L51">View Source</a> |
| </span> |
| <a id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory_Create_" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.Create*"></a> |
| <h4 id="Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory_Create_Lucene_Net_Util_AttributeSource_AttributeFactory_System_IO_TextReader_" data-uid="Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory.Create(Lucene.Net.Util.AttributeSource.AttributeFactory,System.IO.TextReader)">Create(AttributeSource.AttributeFactory, TextReader)</h4> |
| <div class="markdown level1 summary"></div> |
| <div class="markdown level1 conceptual"></div> |
| <h5 class="decalaration">Declaration</h5> |
| <div class="codewrapper"> |
| <pre><code class="lang-csharp hljs">public override Tokenizer Create(AttributeSource.AttributeFactory factory, TextReader reader)</code></pre> |
| </div> |
| <h5 class="parameters">Parameters</h5> |
| <table class="table table-bordered table-striped table-condensed"> |
| <thead> |
| <tr> |
| <th>Type</th> |
| <th>Name</th> |
| <th>Description</th> |
| </tr> |
| </thead> |
| <tbody> |
| <tr> |
| <td><span class="xref">Lucene.Net.Util.AttributeSource.AttributeFactory</span></td> |
| <td><span class="parametername">factory</span></td> |
| <td></td> |
| </tr> |
| <tr> |
| <td><span class="xref">System.IO.TextReader</span></td> |
| <td><span class="parametername">reader</span></td> |
| <td></td> |
| </tr> |
| </tbody> |
| </table> |
| <h5 class="returns">Returns</h5> |
| <table class="table table-bordered table-striped table-condensed"> |
| <thead> |
| <tr> |
| <th>Type</th> |
| <th>Description</th> |
| </tr> |
| </thead> |
| <tbody> |
| <tr> |
| <td><span class="xref">Lucene.Net.Analysis.Tokenizer</span></td> |
| <td></td> |
| </tr> |
| </tbody> |
| </table> |
| <h5 class="overrides">Overrides</h5> |
| <div><span class="xref">Lucene.Net.Analysis.Util.TokenizerFactory.Create(Lucene.Net.Util.AttributeSource.AttributeFactory, System.IO.TextReader)</span></div> |
| </article> |
| </div> |
| |
| <div class="hidden-sm col-md-2" role="complementary"> |
| <div class="sideaffix"> |
| <div class="contribution"> |
| <ul class="nav"> |
| <li> |
| <a href="https://github.com/apache/lucenenet/new/docs/4.8.0-beta00010/websites/apidocs/apiSpec/new?filename=Lucene_Net_Analysis_Cn_Smart_HMMChineseTokenizerFactory.md&value=---%0Auid%3A%20Lucene.Net.Analysis.Cn.Smart.HMMChineseTokenizerFactory%0Asummary%3A%20'*You%20can%20override%20summary%20for%20the%20API%20here%20using%20*MARKDOWN*%20syntax'%0A---%0A%0A*Please%20type%20below%20more%20information%20about%20this%20API%3A*%0A%0A" class="contribution-link">Improve this Doc</a> |
| </li> |
| <li> |
| <a href="https://github.com/apache/lucenenet/blob/release/Lucene.Net_4_8_0_beta00010/src/Lucene.Net.Analysis.SmartCn/HMMChineseTokenizerFactory.cs/#L37" class="contribution-link">View Source</a> |
| </li> |
| </ul> |
| </div> |
| <nav class="bs-docs-sidebar hidden-print hidden-xs hidden-sm affix" id="affix"> |
| <!-- <p><a class="back-to-top" href="#top">Back to top</a><p> --> |
| </nav> |
| </div> |
| </div> |
| </div> |
| </div> |
| |
| <footer> |
| <div class="grad-bottom"></div> |
| <div class="footer"> |
| <div class="container"> |
| <span class="pull-right"> |
| <a href="#top">Back to top</a> |
| </span> |
| Copyright © 2020 Licensed to the Apache Software Foundation (ASF) |
| |
| </div> |
| </div> |
| </footer> |
| </div> |
| |
| <script type="text/javascript" src="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/docfx.vendor.js"></script> |
| <script type="text/javascript" src="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/docfx.js"></script> |
| <script type="text/javascript" src="https://lucenenet.apache.org/docs/4.8.0-beta00009/styles/main.js"></script> |
| </body> |
| </html> |