码迷,mamicode.com
首页 >  
搜索关键字:getpartitions    ( 2个结果
七、rdd究竟是什么
RDD是个抽象类,定义了诸如map()、reduce()等方法,但实际上继承RDD的派生类一般只要实现两个方法: def getPartitions: Array[Partition] def compute(thePart: Partition, context: TaskContext): Ne ...
分类:其他好文   时间:2016-08-02 23:43:14    阅读次数:137
spark内核揭秘-10-RDD源码分析
spark内核揭秘-10-RDD源码分析 ** * A Resilient Distributed Dataset (RDD), the basic abstraction in Spark. Represents an immutable, * partitioned collection of elements that can be operated on in parallel. This class contains the * basic operations available on a...
分类:其他好文   时间:2015-01-21 16:37:59    阅读次数:196
2条  
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!