24小时热门版块排行榜    

CyRhmU.jpeg
查看: 929  |  回复: 9
【奖励】 本帖被评价1次,作者yj222增加金币 0.5
当前主题已经存档。

yj222

木虫 (正式写手)


[资源] 【转贴】 GOOGLE强大的搜索功能揭密【已搜无重复】

PDF格式:
以下是该文中所介绍核心算法的摘要:
ABSTRACT
MapReduce is a programming model and an associated
implementation for processing and generating large
data sets. Users specify a map function that processes a
key/value pair to generate a set of intermediate key/value
pairs, and a reduce function that merges all intermediate
values associated with the same intermediate key. Many
real world tasks are expressible in this model, as shown
in the paper.
Programs written in this functional style are automatically
parallelized and executed on a large cluster of commodity
machines. The run-time system takes care of the
details of partitioning the input data, scheduling the program's
execution across a set of machines, handling machine
failures, and managing the required inter-machine
communication. This allows programmers without any
experience with parallel and distributed systems to easily
utilize the resources of a large distributed system.
Our implementation of MapReduce runs on a large
cluster of commodity machines and is highly scalable:
a typical MapReduce computation processes many terabytes
of data on thousands of machines. Programmers
nd the system easy to use: hundreds of MapReduce programs
have been implemented and upwards of one thousand
MapReduce jobs are executed on Google's clusters
every day.

[ Last edited by 幻影无痕 on 2007-8-9 at 14:02 ]
回复此楼
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

fightingfor

银虫 (小有名气)


看不懂  呵呵
2楼2007-05-17 21:09:13
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

xb202

银虫 (小有名气)


看不懂  呵呵
3楼2007-08-10 10:35:40
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

flashc

金虫 (小有名气)


★★★ 三星级,支持鼓励

先下了,以后有时间再看
4楼2007-10-14 14:40:43
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 yj222 的主题更新
☆ 无星级 ★ 一星级 ★★★ 三星级 ★★★★★ 五星级
普通表情 高级回复(可上传附件)
信息提示
请填处理意见