Data locality

Keyword search

Guided search

Click a term to initiate a search.

CoHadoop: flexible data placement and its exploitation in Hadoop

Wed, 08/17/2011 - 17:40 — kolb

Authors:

Eltabakh, MY; Tian, Y; Özcan, F; Gemulla, R; Krettek, A; McPherson, J

Hadoop has become an attractive platform for large-scale data ana-
lytics. In this paper, we identify a major performance bottleneck of
Hadoop: its lack of ability to colocate related data on the same set
of nodes. To overcome this bottleneck, we introduce CoHadoop,
a lightweight extension of Hadoop that allows applications to con-
trol where data are stored. In contrast to previous approaches, Co-
Hadoop retains the ﬂexibility of Hadoop in that it does not require
users to convert their data to a certain format (e.g., a relational

Year:

2011

Cloud Infrastructure
Colocation
Data locality
Hadoop
MapReduce
Parallel Data Processing

Read more
1 attachment
p575-eltabakh.pdf

Cloud Computing publication categorizer

Keyword search

Guided search

Author

Year

Topic

Tags

mailpart

Citations range

Data locality

CoHadoop: flexible data placement and its exploitation in Hadoop

Navigation

User login