cs.wisc.edu

Keyword search

Guided search

Click a term to initiate a search.

warning: Parameter 1 to scholarly_urltopdf() expected to be a reference, value given in /home/pubs/htdocs_pubs/includes/menu.inc on line 350.

A comparison of approaches to large-scale data analysis

Mon, 10/19/2009 - 09:43 — admin

Authors:

Pavlo, Andrew; Paulson, Erik; Rasin, Alexander; Abadi, Daniel J.; DeWitt, David J.; Madden, Samuel; Stonebraker, Michael

There is currently considerable enthusiasm around the MapReduce
(MR) paradigm for large-scale data analysis [17]. Although the
basic control ﬂow of this framework has existed in parallel SQL
database management systems (DBMS) for over 20 years, some
have called MR a dramatically new computing model [8, 17]. In
this paper, we describe and compare both paradigms. Furthermore,
we evaluate both kinds of systems in terms of performance and de-
velopment complexity. To this end, we deﬁne a benchmark con-
sisting of a collection of tasks that we have run on an open source

Year:

2009

Cloud Computing publication categorizer

Keyword search

Guided search

Author

Year

Topic

Tags

mailpart

Citations range

cs.wisc.edu

A comparison of approaches to large-scale data analysis

Navigation

User login