码迷,mamicode.com
首页 > Web开发 > 详细

Solr4.8.0源码分析(13)之LuceneCore的索引修复

时间:2014-10-23 01:21:15      阅读:231      评论:0      收藏:0      [点我收藏+]

标签:Lucene   style   blog   color   io   os   ar   使用   java   

Solr4.8.0源码分析(13)之LuceneCore的索引修复

题记:今天在公司研究elasticsearch,突然看到一篇博客说elasticsearch具有索引修复功能,顿感好奇,于是点进去看了下,发现原来是Lucene Core自带的功能,于是就回家先学习下,正好也跟之前看的索引文件的格式相应。有空也研究下Lucene的一些小工具。

 

索引的修复主要是用到CheckIndex.java这个类,可以直接查看类的Main函数来了解下。

1. CheckIndex的使用

首先使用以下命令来查看lucenecore.jar怎么使用:

 1 192:lib rcf$ java -cp lucene-core-4.8-SNAPSHOT.jar -ea:org.apache.lucene... org.apache.lucene.index.CheckIndex 
 2 
 3 ERROR: index path not specified
 4 
 5 Usage: java org.apache.lucene.index.CheckIndex pathToIndex [-fix] [-crossCheckTermVectors] [-segment X] [-segment Y] [-dir-impl X]
 6 
 7   -fix: actually write a new segments_N file, removing any problematic segments
 8   -crossCheckTermVectors: verifies that term vectors match postings; THIS IS VERY SLOW!
 9   -codec X: when fixing, codec to write the new segments_N file with
10   -verbose: print additional details
11   -segment X: only check the specified segments.  This can be specified multiple
12               times, to check more than one segment, eg -segment _2 -segment _a.
13               You cant use this with the -fix option
14   -dir-impl X: use a specific FSDirectory implementation. If no package is specified the org.apache.lucene.store package will be used.
15 
16 **WARNING**: -fix should only be used on an emergency basis as it will cause
17 documents (perhaps many) to be permanently removed from the index.  Always make
18 a backup copy of your index before running this!  Do not run this tool on an index
19 that is actively being written to.  You have been warned!
20 
21 Run without -fix, this tool will open the index, report version information
22 and report any exceptions it hits and what action it would take if -fix were
23 specified.  With -fix, this tool will remove any segments that have issues and
24 write a new segments_N file.  This means all documents contained in the affected
25 segments will be removed.
26 
27 This tool exits with exit code 1 if the index cannot be opened or has any
28 corruption, else 0.

当敲java -cp lucene-core-4.8-SNAPSHOT.jar -ea:org.apache.lucene... org.apache.lucene.index.CheckIndex 这个就能看到相当于help的信息啦,但是为什么这里用这么一串奇怪的命令呢?

今天太晚了,明天继续学习

Solr4.8.0源码分析(13)之LuceneCore的索引修复

标签:Lucene   style   blog   color   io   os   ar   使用   java   

原文地址:http://www.cnblogs.com/rcfeng/p/4044763.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!