77范文网 - 专业文章范例文档资料分享平台

I. Ontology-based Information Retrieval(3)

来源:网络收集 时间:2021-04-06 下载这篇文档 手机版
说明:文章内容仅供预览,部分内容可能不全,需要完整文档或者需要复制内容,请下载word后使用。下载word有问题请添加微信号:或QQ: 处理(尽可能给您提供完整文档),感谢您的支持与谅解。点击这里给我发消息

Abstract: In the proposed article a new, ontology-based approach to information retrieval (IR) is presented. The system is based on a domain knowledge representation schema in form of ontology. New resources registered within the system are linked to conce

where S is the diagonal matrix of singular values and U,Vare matrices of left and right singular vectors. If the singular values in S are ordered by size, the firstklargest values may be kept and the remaining smaller ones are set to zero. The product of the resulting matrices is a matrix approximately equal to A, and is closest to A in the least squares sense.

TA ASVD where ASVD=UKSKVK

In order to determine similarity between a query and approximate document vector Di,SVD, we need to transform query vector to new feature space. (Original query vector is computed with tf-idf scheme as described above for vector model approach.)

T 1QSVD=QTF IDFUKSK

and then we can compute similarity in the same way as before, i.e.

simSVD(QSVD,Di,SVD)=Di,SVD×QSVD

Di,SVDQSVD.

2.3. ONTOLOGY-BASED APPROACH

This part describes the Webocrat-like approach that uses ontology for document retrieval purposes. For the experiments described below we did not consider type of relation in ontology for calculation of similarity between concepts. Moreover, we assumed that the set of relevant concepts to the query is known. But this condition can be achieved with any technique for assigning concepts from ontology to a query, e.g. based on manual assignment or based on synonyms to query terms, making use of Wordnet or other.

The way in which a query is processed by this approach is shown on the Figure 1. For a given query first appropriated concepts are retrieved - in our case manually from the user. Then the set of concepts associated with each document is retrieved from database. As next, these two sets are compared using simple metric, which expresses the similarity between a document Di and given query Q.

Qcon∪Di,conifQcon∪Di,con≠0

simonto(Q,Di)=

k

where Qcon is a set of concepts assigned to query Q and Dcon is a set of concepts assigned to document Di, and k is small constant, e.g. 0.1. Resulted number represents ontology-based similarity measure. Better results have been achieved when this number have been combined with some of the previous two retrieval approaches described above (i.e. LSI approach or vector model). The final similarity is then computed as multiplication, e.g.

sim(Q,Di)=simonto(Q,Di) simTF IDF(Q,Di)

百度搜索“77cn”或“免费范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读,免费范文网,提供经典小说教育文库I. Ontology-based Information Retrieval(3)在线全文阅读。

I. Ontology-based Information Retrieval(3).doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印 下载失败或者文档不完整,请联系客服人员解决!
本文链接:https://www.77cn.com.cn/wenku/jiaoyu/1214189.html(转载请注明文章来源)
Copyright © 2008-2022 免费范文网 版权所有
声明 :本网站尊重并保护知识产权,根据《信息网络传播权保护条例》,如果我们转载的作品侵犯了您的权利,请在一个月内通知我们,我们会及时删除。
客服QQ: 邮箱:tiandhx2@hotmail.com
苏ICP备16052595号-18
× 注册会员免费下载(下载后可以自由复制和排版)
注册会员下载
全站内容免费自由复制
注册会员下载
全站内容免费自由复制
注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信: QQ: