librelist archives

« back to archive

Sobre mahout y spark

Sobre mahout y spark

From:
Matias Eduardo Bordone Carranza
Date:
2014-11-18 @ 22:15
Hablando del uso de hadoop Vs. Spak encontré esto en la página de mahout

25 April 2014 - Goodbye MapReduce

The Mahout community decided to move its codebase onto modern data
processing systems that offer a richer programming model and more efficient
execution than Hadoop MapReduce. *Mahout will therefore reject new
MapReduce algorithm implementations from now on*. We will however keep our
widely used MapReduce algorithms in the codebase and maintain them.

We are building our future implementations on top of a DSL for linear
algebraic operations
<https://mahout.apache.org/users/sparkbindings/home.html> which has been
developed over the last months. Programs written in this DSL are
automatically optimized and executed in parallel on Apache Spark
<http://spark.apache.org/>.

Furthermore, there is an experimental contribution undergoing which
aims to integrate
the h20 platform <https://issues.apache.org/jira/browse/MAHOUT-1500> into
Mahout.

-- 
    "Si tú tienes una manzana y yo tengo una manzana e intercambiamos
    las manzanas, entonces tanto tú como yo seguiremos teniendo una
    manzana. Pero si tú tienes una idea y yo tengo una idea e
    intercambiamos ideas, entonces ambos tendremos dos ideas."
    George Bernard Shaw

Re: [aprendizajengrande] Sobre mahout y spark

From:
Pablo Duboue
Date:
2014-11-21 @ 22:40
Si, eso lo charlamos en las primeras clases de la materia.

On 11/18/14, Matias Eduardo Bordone Carranza <mebordone@gmail.com> wrote:
> Hablando del uso de hadoop Vs. Spak encontré esto en la página de mahout
>
> 25 April 2014 - Goodbye MapReduce
>
> The Mahout community decided to move its codebase onto modern data
> processing systems that offer a richer programming model and more efficient
> execution than Hadoop MapReduce. *Mahout will therefore reject new
> MapReduce algorithm implementations from now on*. We will however keep our
> widely used MapReduce algorithms in the codebase and maintain them.
>
> We are building our future implementations on top of a DSL for linear
> algebraic operations
> <https://mahout.apache.org/users/sparkbindings/home.html> which has been
> developed over the last months. Programs written in this DSL are
> automatically optimized and executed in parallel on Apache Spark
> <http://spark.apache.org/>.
>
> Furthermore, there is an experimental contribution undergoing which
> aims to integrate
> the h20 platform <https://issues.apache.org/jira/browse/MAHOUT-1500> into
> Mahout.
>
> --
>     "Si tú tienes una manzana y yo tengo una manzana e intercambiamos
>     las manzanas, entonces tanto tú como yo seguiremos teniendo una
>     manzana. Pero si tú tienes una idea y yo tengo una idea e
>     intercambiamos ideas, entonces ambos tendremos dos ideas."
>     George Bernard Shaw
>