¸üÐÂʱ¼ä:2020Äê12ÔÂ22ÈÕ18ʱ05·Ö À´Ô´:ÀÖÓãµç¾º ä¯ÀÀ´ÎÊý:

RDD( Resilient Distributed Dataset£¬µ¯ÐÔ·Ö²¼Ê½Êý¾Ý¼¯)£¬ÊÇÒ»¸öÈÝ´íµÄ¡¢²¢ÐеÄÊý¾Ý½á¹¹£¬¿ÉÒÔÈÃÓû§ÏÔʽµØ½«Êý¾Ý´æ´¢µ½´ÅÅ̺ÍÄÚ´æÖУ¬²¢ÇÒ»¹ÄÜ¿ØÖÆÊý¾ÝµÄ·ÖÇø¡£¶ÔÓÚµü´úʽ¼ÆËãºÍ½»»¥Ê½Êý¾ÝÍÚ¾ò£¬RDD¿ÉÒÔ½«ÖÐ¼ä¼ÆËãµÄÊý¾Ý½á¹û±£´æÔÚÄÚ´æÖУ¬ÈôÊǺóÃæÐèÒªÖмä½á¹û²ÎÓë¼ÆËãʱ£¬Ôò¿ÉÒÔÖ±½Ó´ÓÄÚ´æÖжÁÈ¡£¬´Ó¶ø¿ÉÒÔ¼«´óµØÌá¸ß¼ÆËãËÙ¶È¡£
ÿ¸öRDD¶¼¾ßÓÐÎå´óÌØÕ÷£¬¾ßÌåÈçÏ¡£
1.·ÖÇøÁбí( a list of partitions)
ÿ¸öRDD±»·ÖΪ¶à¸ö·ÖÇø(Partitions)£¬ÕâЩ·ÖÇøÔËÐÐÔÚ¼¯ÈºÖеIJ»Í¬½Úµã£¬Ã¿¸ö·ÖÇø¶¼»á±»Ò»¸ö¼ÆËãÈÎÎñ´¦Àí£¬·ÖÇøÊý¾ö¶¨Á˲¢ÐмÆËãµÄÊýÁ¿£¬´´½¨RDDʱ¿ÉÒÔÖ¸¶¨RDD·ÖÇøµÄ¸öÊý¡£Èç¹û²»Ö¸¶¨·ÖÇøÊýÁ¿£¬µ±RDD´Ó¼¯ºÏ´´½¨Ê±£¬Ä¬ÈÏ·ÖÇøÊýÁ¿Îª¸Ã³ÌÐòËù·ÖÅäµ½µÄ×ÊÔ´µÄCPUºËÊý(ÿ¸öCore¿ÉÒÔ³ÐÔØ2~4¸öPartition)£¬Èç¹ûÊÇ´ÓHDFSÎļþ´´½¨£¬Ä¬ÈÏΪÎļþµÄBlockÊý¡£
2.ÿ¸ö·ÖÇø¶¼ÓÐÒ»¸ö¼ÆË㺯Êý( a function for computing each split)
SparkµÄRDDµÄ¼ÆË㺯ÊýÊÇÒÔ·ÖÆ¬Îª»ù±¾µ¥Î»µÄ£¬Ã¿¸öRDD¶¼»áʵÏÖ computeº¯Êý£¬¶Ô¾ßÌåµÄ·ÖƬ½øÐмÆËã¡£
3.ÒÀÀµÓÚÆäËûRDD(a list of dependencies on other RDDs)
RDDµÄÿ´Îת»»¶¼»áÉú³ÉÒ»¸öеÄRDD£¬ËùÒÔRDDÖ®¼ä¾Í»áÐγÉÀàËÆÓÚÁ÷Ë®ÏßÒ»ÑùµÄǰºóÒÀÀµ¹ØÏµ¡£ÔÚ²¿·Ö·ÖÇøÊý¾Ý¶ªÊ§Ê±£¬Spark¿ÉÒÔͨ¹ýÕâ¸öÒÀÀµ¹ØÏµÖØÐ¼ÆË㶪ʧµÄ·ÖÇøÊý¾Ý£¬¶ø²»ÊǶÔRDDµÄËùÓзÖÇø½øÐÐÖØÐ¼ÆËã¡£
4.(Key£¬Value)Êý¾ÝÀàÐ͵ÄRDD·ÖÇøÆ÷(a Partitioner for Key-Value RDDS)
µ±Ç°SparkÖÐʵÏÖÁËÁ½ÖÖÀàÐ͵ķÖÇøº¯Êý£¬Ò»¸öÊÇ»ùÓÚ¹þÏ£µÄHashPartitioner£¬ÁíÍâ¸öÊÇ»ùÓÚ·¶Î§µÄRangePartitioner¡£Ö»ÓжÔÓÚ(Key£¬Value)µÄRDD£¬²Å»áÓÐPartitioner(·ÖÇø)£¬·Ç(Key£¬Value)µÄRDDµÄPartitionerµÄÖµÊÇNone¡£Partitionerº¯Êý²»µ«¾ö¶¨ÁËRDD±¾ÉíµÄ·ÖÇøÊýÁ¿£¬Ò²¾ö¶¨ÁËparent RDD ShuffleÊä³öʱµÄ·ÖÇøÊýÁ¿¡£
5.ÿ¸ö·ÖÇø¶¼ÓÐÒ»¸öÓÅÏÈλÖÃÁбí(a list of preferred locations to compute each split on)
ÓÅÏÈλÖÃÁбí»á´æ´¢Ã¿¸öPartitionµÄÓÅÏÈλÖ㬶ÔÓÚÒ»¸öHDFSÎļþÀ´Ëµ£¬¾ÍÊÇÿ¸öPartition¿éµÄλÖᣰ´ÕÕ“ÒÆ¶¯Êý¾Ý²»ÈçÒÆ¶¯¼ÆË㔵ÄÀíÄSparkÔÚ½øÐÐÈÎÎñµ÷¶ÈµÄʱºò£¬»á¾¡¿ÉÄܵؽ«¼ÆËãÈÎÎñ·ÖÅäµ½ÆäËùÒª´¦ÀíÊý¾Ý¿éµÄ´æ´¢Î»Öá£
²ÂÄãϲ»¶
RDDת»»Ëã×ÓAPI¹ý³ÌÑÝʾ
²»Í¬ÏµÍ³ÈçºÎ¼ÓÔØÊý¾Ý´´½¨RDD?
RDDΪʲôҪ½øÐÐÊý¾Ý³Ö¾Ã»¯?ËüµÄ²Ù×÷·½·¨ÓÐÄÄЩ?
ÀÖÓãµç¾º´óÊý¾ÝÅàѵ¿Î³Ì
ÔõÑùʹÓÃSpark ShellÀ´¶ÁÈ¡HDFSÎļþ£¿
2020-12-21ScalaµÄ¿ØÖƽṹÓï¾äÓм¸ÖÖ£¿¸÷Óï¾äµÄÓï·¨¸ñʽÊÇʲô£¿
2020-12-17IDEA¹¤¾ß¿ª·¢WordCountµ¥´Ê¼ÆÊý³ÌÐòµÄÏà¹Ø²½ÖèÓÐÄÄЩ£¿
2020-12-17ScalaµÄÉùÃ÷ÖµºÍ±äÁ¿¡¾´óÊý¾ÝÎÄÕ¡¿
2020-12-17MapReduce±à³ÌµÄÁ½ÖÖÊý¾ÝÁ÷Ä£ÐÍÑÝʾ
2020-12-17HBaseÊý¾Ý¿âÊÇÔõÑù´æ´¢Êý¾ÝµÄ£¿
2020-12-17
±±¾©Ð£Çø