24小时热门版块排行榜    

查看: 273  |  回复: 1
当前主题已经存档。

mxxmc

铁虫 (正式写手)

[交流] 搜索引擎术语

搜索引擎术语(转)
-----------已搜索过了,无重贴。

研究搜索引擎必须对相关术语、概念有确切的理解,词汇手册是必不可少的工具,国内这方面的翻译、整理工作丞待加强。近来因学习需要,常查检有关索引擎术语的英文注释,自觉收益非浅。查找和收集英文的有关搜索引擎的词汇,要用GOOGLE等英文搜索引擎,关键词“Search Engine Glossary”。如:

Boolean search: A search allowing the inclusion or exclusion of documents containing certain words through the use of operators such as AND, NOT and OR.

Concept search: A search for documents related conceptually to a word, rather than specifically containing the word itself.

Full-text index: An index containing every word of every document cataloged, including stop words (defined below).

Fuzzy search: A search that will find matches even when words are only partially spelled or misspelled.

Index: The searchable catalog of documents created by search engine software. Also called "catalog." Index is often used as a synonym for search engine. Index is commonly pluralized as "indices." However, Search Engine Watch instead uses the alternative plural form "indexes."

Keyword search: A search for documents containing one or more words that are specified by a user.

Phrase search: A search for documents containing a exact sentence or phrase specified by a user.

Precision: The degree in which a search engine lists documents matching a query. The more matching documents that are listed, the higher the precision. For example, if a search engine lists 80 documents found to match a query but only 20 of them contain the search words, then the precision would be 25%.

Proximity search: A search where users to specify that documents returned should have the words near each other.

Query-By-Example: A search where a user instructs an engine to find more documents that are similar to a particular document. Also called "find similar."

Recall: Related to precision, this is the degree in which a search engine returns all the matching documents in a collection. There may be 100 matching documents, but a search engine may only find 80 of them. It would then list these 80 and have a recall of 80%.

Relevancy: How well a document provides the information a user is looking for, as measured by the user.

Search Engine: The software that searches an index and returns matches. Search engine is often used synonymously with spider and index, although these are separate components that work with the engine.

Spider: The software that scans documents and adds them to an index by following links. Spider is often used as a synonym for search engine.

Stemming: The ability for a search to include the "stem" of words. For example, stemming allows a user to enter "mming" and get back results also for the stem word "m."

Stop words: Conjunctions, prepositions and articles and other words such as AND, TO and A that appear often in documents yet alone may contain little meaning.

Thesaurus: A list of synonyms a search engine can use to find matches for particular words if the words themselves don't appear in documents.

其释义要比汉语准确和容易理解。有兴趣和语言基础的朋友不妨体会一下从英语学习搜索引擎乐趣
回复此楼
Nevertoooldtolearn.
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

littlehuaniu

新虫 (小有名气)

1

好,对新手有用
2楼2005-02-26 13:42:12
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 mxxmc 的主题更新
普通表情 高级回复 (可上传附件)
信息提示
请填处理意见