Speculative execution

Keyword search

Guided search

Click a term to initiate a search.

warning: Parameter 1 to scholarly_urltopdf() expected to be a reference, value given in /home/pubs/htdocs_pubs/includes/menu.inc on line 350.

Improving mapreduce performance in heterogeneous environments

Fri, 04/16/2010 - 16:33 — kolb

Authors:

Zaharia, Matei; Konwinski, Andy; Joseph, Anthony D.; Katz, Randy; Stoica, Ion

Abstract
MapReduce is emerging as an important programming
model for large-scale data-parallel applications such as
web indexing, data mining, and scientiﬁc simulation.
Hadoop is an open-source implementation of MapRe-
duce enjoying wide adoption and is often used for short
jobs where low response time is critical. Hadoop’s per-
formance is closely tied to its task scheduler, which im-
plicitly assumes that cluster nodes are homogeneous and
tasks make progress linearly, and uses these assumptions
to decide when to speculatively re-execute tasks that ap-

Year:

2008

cs.berkeley.edu
Hadoop
MapReduce
Parallel Data Processing
Speculative execution

Read more
1 attachment
Zaharia2008Improvingmapreduceperformanceinheterogeneousenvironments.pdf

Cloud Computing publication categorizer

Keyword search

Guided search

Author

Year

Topic

Tags

mailpart

Citations range

Speculative execution

Improving mapreduce performance in heterogeneous environments

Navigation

User login