rdd

February 13, 2021

Generate Sequential and Unique IDs in a Spark Dataframe

Apache Spark is an open source, general-purpose distributed computing engine used for processing and analyzing a large amount of data. Hence, adding sequential and unique IDs to a Spark Dataframe is not very straight forward, because of distributed nature of it.

Generate Sequential and Unique IDs in a Spark Dataframe

Like this:

Recent Posts

Categories

Recent Posts

Categories

Find Us

rdd

Generate Sequential and Unique IDs in a Spark Dataframe

Share this:

Like this:

Recent Posts

Categories

Tags

Recent Posts

Categories

Tags

Find Us