一切按照官方的说明文档来安装,但到最好配置时却老配置老出错。最终只能再google一下,按下面的配置才算搞定。刚玩coreseek,对一些参数还不是很熟悉,但又想配置起来玩下,没有好好地看官方说明文档呵呵。
防止出现编译错误,先安装以下程序
yum -y install mysql mysql-devel php-mysql qt4-mysql python python-dev gcc-c++ gtk+ libtool automake autoconf glibc-common expat-devel
1、安装
wget ####coreseek源文件
wget #####coreseek所使用的词典
tar zxvf csft-3.1.tar.gz
tar zxvf mmseg-3.1.tar.gz
#####在安装coreseek前必须先安装mmseg
cd mmseg-3.1
./configure –prefix=/usr/local/mmseg
make
make install
######## 安装coreseek ########
##这里不使用python数据源,若需要,请加上 –with-python,在mmseg上一定要对应路径
./configure –prefix=/usr/local/coreseek –with-mmseg-includes=/usr/local/mmseg/include/mmseg –with-mmseg-libs=/usr/local/mmseg/lib –without-iconv
指定–enable-id64选项会打开64位文档ID和词ID的支
make
make install
若无问题,安装完毕后在/usr/local/下生成 coreseek目录及其下文件。
接下来要生成 mmseg词库及配置文件:
cd /usr/loca/mmseg
/usr/local/mmseg/bin/mmseg -u /usr/local/src/mmseg-3.1/data/unigram.txt ###unigram.txt是对应的词典文件,将会生成unigram.txt.uni
cd ../coreseek
mkdir dict ###创建字典目录
cp /usr/local/src/mmseg-3.1/data/unigram.txt.uni dict/uni.lib ###把创建的词典复制到dict
vim dict/mmseg.ini ####创建mmseg的配置文件,此文件在coreseek的windows版本已自带!
mmseg.ini:
[mmseg]
merge_number_and_ascii=1;
number_and_ascii_joint=-;
compress_space=0;
seperate_number_ascii=1;
至此,mmseg配置完毕!下一步配置csft.conf——coreseek的配置文件
source article
{
type = mysql
sql_host = localhost
sql_user = root
sql_pass = jiaxian
sql_db = test
sql_port = 3306 # optional, default is 3306
sql_query_pre = SET NAMES utf8
#sql_query_pre = SET SESSION query_cache_type=OFF ##这个可以关闭sql查询缓存
#sql_query = SELECT id, classid, checked, title, newstime, newstext FROM article
sql_query_range = SELECT MIN(id),MAX(id) FROM article
sql_range_step = 1000
sql_query = SELECT id, classid, checked, title, newstime, newstext FROM article WHERE id>=$start AND id<=$end
sql_attr_uint = classid
sql_attr_uint = checked
sql_attr_uint = newstime
sql_query_info = select * from article where id=$id
}
index article
{
source = article
path = /usr/local/coreseek/var/data/article
docinfo = extern
charset_type = zh_cn.utf-8 ###指定coreseek的编码
charset_dictpath = /usr/local/coreseek/dict #####coreseek字典文件