sc.parallelize([(1, 2), (1, 3), (2, 3), (2, 4), (3, 1)]).reduceByKey(lambda x, y : x + y).count().collect()操作中会产生多少个 stage()
1
2
3
4