ÀÖÓãµç¾º

    ½ÌÓýÐÐÒµA¹ÉIPOµÚÒ»¹É£¨¹ÉƱ´úÂë 003032£©

    È«¹ú×Éѯ/ͶËßÈÈÏߣº400-618-4000

    DStreamÊÇʲô?ÔõÑù¶ÔDStream½øÐвÙ×÷?

    ¸üÐÂʱ¼ä:2021Äê12ÔÂ27ÈÕ11ʱ41·Ö À´Ô´:ÀÖÓãµç¾º ä¯ÀÀ´ÎÊý:

    DStreamµÄ±¾ÖÊ

    DStream(Discretized Stream)ÊÇSpark StreamingÌṩµÄ»ù±¾Êý¾Ý³éÏó¡£Ëü±íʾһ¸öÁ¬ÐøµÄÊý¾ÝÁ÷£¬¿ÉÒÔÊÇ´ÓÔ´½ÓÊÕµ½µÄÊäÈëÊý¾ÝÁ÷£¬Ò²¿ÉÒÔÊÇͨ¹ýת»»ÊäÈëÁ÷Éú³ÉµÄÒÑ´¦ÀíÊý¾ÝÁ÷¡£

    DStreamÓÉһϵÁÐÁ¬ÐøµÄRDD±íʾ£¬Ã¿¸öRDD¶¼°üº¬À´×ÔÌØ¶¨¼ä¸ôµÄÊý¾Ý£¬ÈçÏÂͼËùʾ¡£SparkStreaming¶ÔÁ÷Êý¾Ý°´ÕÕÃë/·ÖµÈʱ¼ä¼ä¸ô½øÐÐ΢Åú»®·Ö£¬Ã¿¸ö΢Åú¾ÍÊÇÒ»¸öRDD£¬ÕâЩ¸öʱ¼äÉÏÁ¬ÐøµÄRDD¾Í×é³ÉÁË

    DStream

    ËùÒÔDStream±¾ÖÊÉϾÍÊÇһϵÁÐʱ¼äÉÏÁ¬ÐøµÄRDD¼´DStream=>Seq[RDD]

    ¶ÔDStream½øÐвÙ×÷

    ¶ÔDStream½øÐвÙ×÷(È磺flatMap/map/filter..)¾ÍÊÇ¶ÔÆäµ×²ãµÄRDD½øÐвÙ×÷

    ¶ÔRDD²Ù×÷»á·µ»ØÐµÄRDD£¬¶ÔDStream½øÐвÙ×÷Ò²»á·µ»ØÐµÄDStream

    DStream¾ßÓÐÈÝ´íÐÔ£º

    RDDÖ®¼ä´æÔÚÒÀÀµ¹ØÏµ£¬DStream¼äÒ²ÓÐÒÀÀµ¹ØÏµ£¬RDD¾ßÓÐÈÝ´íÐÔ£¬ÄÇôDStreamÒ²¾ßÓÐÈÝ´íÐÔ

    ÉÏͼÏà¹ØËµÃ÷£º

    1¡¢Ã¿Ò»¸öÍÖÔ²Ðαíʾһ¸öRDD

    2¡¢ÍÖÔ²ÐÎÖеÄÿ¸öÔ²Ðδú±íÒ»¸öRDDÖеÄÒ»¸öPartition·ÖÇø

    3¡¢Ã¿Ò»ÁеĶà¸öRDD±íʾһ¸öDStream(ͼÖÐÓÐÈýÁÐËùÒÔÓÐÈý¸öDStream

    4¡¢Ã¿Ò»ÐÐ×îºóÒ»¸öRDDÔò±íʾÿһ¸öBatch SizeËù²úÉúµÄÖмä½á¹ûRDD

    DStreamµÄAPI

    ´ó¶àÊýTransformationºÍAction/OutputºÍ֮ǰµÄRDDµÄÒ»ÑùʹÓÃ.ÉÙ²¿·Ö²»Ò»ÑùµÄͨ¹ý°¸Àý½²½â

    DStream Operations

    Transformation
    ´ó¶àÊýºÍRDDÖеÄÀàËÆ£¬µ«ÓÐÒ»Ð©ÌØÊâµÄÕë¶ÔÌØ¶¨ÀàÐÍÓ¦ÓÃʹÓõĺ¯Êý£¬±ÈÈçupdateStateByKey״̬º¯Êý¡¢window´°¿Úº¯ÊýµÈ£¬ºóÐø¾ßÌå½áºÏ°¸Àý½²½â¡£
    http://spark.apache.org/docs/latest/streaming-programming-guide.html#transformations-on-dstreams

    Outputº¯Êý

    Output Operations:½«DStreamÖÐÿÅú´ÎRDD´¦Àí½á¹ûresultRDDÊä³ö
    http://spark.apache.org/docs/latest/streaming-programming-guide.html#output-operations-on-dstreams



    ²ÂÄãϲ»¶£º

    Á½ÖÖRDDµÄÒÀÀµ¹ØÏµ½éÉÜ

    SparkStreamingÁ¬½ÓKafkaÁ½ÖÖ·½Ê½

    SparkÉú̬ϵͳ°üº¬ÄÄЩ×é¼þ£¿

    Spark´¦ÀíÊý¾ÝµÄËٶȱÈHive¸ü¿ì£¿Ô­ÒòÊÇʲô£¿

    ÀÖÓãµç¾ºpython+´óÊý¾Ý¿ª·¢Åàѵ

    0 ·ÖÏíµ½£º
    ºÍÎÒÃÇÔÚÏß½»Ì¸£¡


    ¡¾ÍøÕ¾µØÍ¼¡¿¡¾sitemap¡¿