用Lucene搜索连字符词_Lucene - Fatal编程技术网

用Lucene搜索连字符词

lucene

用Lucene搜索连字符词,lucene,Lucene,我要lucene搜索连字符的单词，例如：节能或“节能”作为一个词因此，如果输入是节能的，标记器会生成如下术语节能或高效或节能或节能因此lucene返回的页面包含“节能”和“节能”，但我希望它返回的页面只包含“节能”页面因此，问题是如何修改standardtokenizer以搜索整个节能单词，而不是将其拆分为单独的单词。使用WhitespaceAnalyzer而不是standardAnalyzer 这将生成仅在空白处分割的标记。但是，检查是否还有其他将要更改的内容。这是我在上的完整博客

我要lucene搜索连字符的单词，例如：节能或“节能”作为一个词

因此，如果输入是节能的，标记器会生成如下术语节能或高效或节能或节能

因此lucene返回的页面包含“节能”和“节能”，但我希望它返回的页面只包含“节能”页面

因此，问题是如何修改standardtokenizer以搜索整个节能单词，而不是将其拆分为单独的单词。

使用

WhitespaceAnalyzer

而不是

standardAnalyzer

这将生成仅在空白处分割的标记。但是，检查是否还有其他将要更改的内容。

这是我在

上的完整博客如果您想在

StandardAnalyzer

中支持连字符，则必须在负责标记化的

StandardTokenizerImpl

中进行更改

标准标记器将连字符的单词一分为二，例如“energy-efficient”标记为energy，efficient。

由于类生成了

StandardTokenizerImpl.java

，其输入文件为

StandardTokenizerImpl.jflex

，您必须在

supplemental.jflex宏

中添加以下行，

StandardTokenizerImpl.jflex

            MidLetterSupp = ( [\u002D]  )

之后，使用jflex生成StandardTokenizerImpl.java并重建索引。

非常感谢！关于这些东西没有太多的文档。我有一个自定义的

分析器

来防止停止字被过滤，所以我现在在分析器中使用

空白标记器

而不是

标准标记器

。但是要注意，当使用

空白标记符时，搜索会区分大小写。所以我必须先用一个小写过滤器来包装它。恐怕不行！例如，空白分析器的名称非常准确：“bubble”被视为与“bubble:”完全不同的标记。对于99%的情况，这没有多大用处。。。我觉得这对手术没用




[visual studio 2017]相关文章推荐



                                                        
Visual studio 2017 .NET标准项目的代码覆盖率
visual-studio-2017 
Visual studio 2017 从VS2017解决方案在TFS2013上执行NuGet还原不起作用
visual-studio-2017 
Visual studio 2017 TFS 2017生成未编译新的.exe
visual-studio-2017 
Visual studio 2017 Fortran间歇式错误
visual-studio-2017fortran 
Visual studio 2017 Visual studio 2017在从netstandard转换为framework后未将csproj视为有效的项目文件
visual-studio-2017 
Visual studio 2017 在寄存器中添加自定义SQL列
visual-studio-2017 
Visual studio 2017 检测到的包版本超出依赖项约束microsoft.aspnetcore.app 2.1.1需要microsoft.netcore.razor.design..”；
visual-studio-2017asp.net-core-mvc 
Visual studio 2017 Google.Ads.GoogleAds NuGet包未自动引用
visual-studio-2017nuget 
                                       





随机文章推荐



                                                        
在Qt4中，有没有一种方法可以从小部件内部知道焦点是否丢失？
qt4 
Qt4 在qt创建者中使用dll
qt4 
Qt4 QMainWindow启动前会出现一个小窗口
qt4 
Qt4 如何在QGraphicsView中安装QGraphicscene
qt4 
Qt4 Qt-Windows 7基本主题下禁用QPushButton的样式表
qt4


                                        

                                        
                                        


                                                
                                                        [lucene]相关推荐
                                                        
使用DBsight-lucene的动词屈折形式？
									Lucene
							 
Lucene评分问题
									Lucene
							 
Lucene 确定哪个值在SOLR多值字段类型中产生了命中
									Lucene
							 
couchdb-lucene的分页结果
									Lucene
							 									Pagination
							 									Couchdb
							 
Lucene 可以使用名称中具有给定前缀的字段进行搜索？
									Lucene
							 
Lucene fieldNorm相似度计算和查询时间值之间的差异
									Lucene
							 									Indexing
							 
Lucene 查找每个相似文档组中的一个文档
									Lucene
							 
子串Lucene分析器
									Lucene
							 
Lucene空白分析器忽略短语？
									Lucene
							 
如何在elasticsearch或lucene中增强基于索引类型的搜索？
									Lucene
							 
Lucene 4.0.0时间范围搜索
									Lucene
							 
无法在lucene中获取搜索文本
									Lucene
							 
Lucene 使用大型数据集时，Jena文本查询性能会显著降低
									Lucene
							 									Rdf
							 									Sparql
							 
Lucene 在kibana中的数组中搜索
									Lucene
							 									Kibana
							 
Alfresco:无法按日期通过Lucene进行搜索
									Lucene
							 									Alfresco
							 
Lucene查询语法-如何添加时间？
									Lucene
							 									Kibana
							 
Lucene sitecore 7.2的高级数据库爬虫问题
									Lucene
							 									Sitecore
							 
Elasticsearch：一个文档中总术语频率的总和
									Lucene
							 
Lucene QueryParser或Query：获取所有有效的必需术语
									Lucene
							 
Elasticsearch inner_在ArrayOutOfBoundsException中命中查询结果
									Lucene
							 
仅当与第一个筛选器不匹配时，Elasticsearch筛选器
									Lucene
							 
Elasticsearch-如何查询特定字段随时间变化的结果
									Lucene
							 
Lucene搜索不会在Orientdb服务器重新启动时返回结果
									Lucene
							 									Orientdb
							 
Hibernate搜索（Lucene）在'='；串
									Lucene
							 
Elasticsearch 弹性搜索中的转义保留词
									Lucene
							 
在lucene.net QueryParser中使用通配符的问题
									Lucene
							 
Lucene 将字段/字符串长度添加到日志存储事件
									Lucene
							 									Kibana
							 
Lucene StandardAnalyzer在写入索引时未转换为小写
									Lucene
							 
使用Lucene或Elasticsearch自定义索引和评分
									Lucene
							 
在Elasticsearch中搜索包含；不是"；关键词
									Lucene
							 									Kibana
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Msbuild
Compiler Construction
Network Programming
Hybris
Debugging
Automation
Cuda
Eclipse Rcp
Python 3.x
String
Install4j
Sass
Processing
Push Notification
Composer Php
Computer Vision
Variables
Magento
Socket.io
Generics
Character Encoding
Cakephp
Mapbox
Model View Controller
Jupyter Notebook
Atom Editor
Csv
Webview
Ios7
Laravel
Swing
Iphone
Bots
Cocos2d Iphone
Oauth 2.0
Rabbitmq
Google Visualization
Silverlight
Sip
Pytorch
Here Api
Sed
Virtualbox
Android Ndk
Maven 2
Odata
Canvas
Vb6
Prometheus
Tkinter
Replace
Url
Dependencies
Migration
E Commerce
Gtk
Windows 10
Stored Procedures
Flash
Antlr4
Cron
Itext
Apache Pig
Woocommerce
Content Management System
Linux Kernel
Jvm
Xna
Subsonic
Input
Awk
Scikit Learn
Command Line
Windows 8
Firefox Addon
Utf 8
Menu
Haskell
Php
Gstreamer
Ruby On Rails 3.1
Validation
Nestjs
Recursion
Backbone.js
Java 8
Web Scraping
Ssis
Notepad++
Maps
Lua
Swift
Scroll
Random
Asterisk
Modelica
Xpath
Project Management
Playframework 2.0
Iis
Centos
Bison
Google Drive Api
Chart.js
Linq To Sql
Animation
Button
Reflection
Kentico
Continuous Integration
Active Directory
Memory Management
Com
Logic
X86
Nsis
C# 3.0
Netlogo
Google Cloud Dataflow
Acumatica
Sqlalchemy
Routing
Mdx
Syntax
Visual C++
Android Studio
Oop
Jaxb
Google Maps Api 3
Webpack
Apache
Azure Devops
Cygwin
Cookies
File
Editor
Data Structures
Ag Grid
Security
Glassfish
Cluster Computing
Design Patterns
Plsql
Layout
Twilio
Fullcalendar
Google Apps Script
Windbg
Weblogic
Url Rewriting
Pagination
Floating Point
Powerbi
C++11
Intellij Idea
Pycharm
Ftp
Joomla
Charts
Opengl
Rx Java
Ember.js
Ipad
Makefile
Sitecore
Azure
Autocomplete
Google Colaboratory
File Upload
Calendar
Numpy
Seo
Facebook
Jwt
Class
Html
Powershell
Oracle10g
Apache Spark
Serialization
Orm
Installation
Process
Influxdb
Octave
Yaml
Dart
Wso2
Serial Port
Ruby On Rails 4
C#
Vba
Common Lisp
Oauth
Visual Studio 2010
Gps
Android Layout
Google Chrome
Wolfram Mathematica
Select
Gruntjs


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网