[go: nahoru, domu]

Skip to content
#

data-flow

Here are 169 public repositories matching this topic...

frankframework

The Data Pulse pipeline processes and transforms web-scraped pageviews using Apache Beam and Google Cloud Dataflow. It reads JSON lines, parses them into PageView objects, filters for "product" post types, enriches with country info, and writes to Google BigQuery. Robust logging and error handling ensure data integrity

  • Updated Jul 2, 2024
  • Java

Improve this page

Add a description, image, and links to the data-flow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-flow topic, visit your repo's landing page and select "manage topics."

Learn more