About Bloom
Listed here, we make use of the explode perform in choose, to transform a Dataset of strains to the Dataset of terms, then Incorporate groupBy and depend to compute the for each-word counts within the file as being a DataFrame of two columns: ??word??and ??count|rely|depend}?? To gather the phrase counts in our shell, we can call obtain:|intersecti