Skip to content
Change the repository type filter

All

    Repositories list

    • Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
      Java
      Apache License 2.0
      2612895058Updated Apr 20, 2026Apr 20, 2026
    • BigQuery connector for Apache Flink
      Java
      Apache License 2.0
      2638115Updated Apr 15, 2026Apr 15, 2026
    • TypeScript
      Apache License 2.0
      189739Updated Apr 14, 2026Apr 14, 2026
    • BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
      Java
      Apache License 2.0
      2264214011Updated Apr 12, 2026Apr 12, 2026
    • Python
      Apache License 2.0
      40010Updated Apr 8, 2026Apr 8, 2026
    • Cloud Spanner Connector for Apache Spark
      Java
      Apache License 2.0
      221825Updated Apr 8, 2026Apr 8, 2026
    • Python
      Apache License 2.0
      14718Updated Apr 7, 2026Apr 7, 2026
    • Tools for creating Dataproc custom images
      Python
      Apache License 2.0
      673565Updated Mar 26, 2026Mar 26, 2026
    • Scala
      Apache License 2.0
      176137Updated Mar 17, 2026Mar 17, 2026
    • Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
      Shell
      Apache License 2.0
      5165987949Updated Mar 17, 2026Mar 17, 2026
    • Cloud Dataproc: Samples and Utils
      Jupyter Notebook
      Apache License 2.0
      13220529Updated Mar 17, 2026Mar 17, 2026
    • A library enabling BigQuery as Hive storage handler
      Java
      Apache License 2.0
      18101412Updated Mar 11, 2026Mar 11, 2026
    • Library to simplify running distributed ML workloads with Apache Spark
      Python
      Apache License 2.0
      2710Updated Jan 28, 2026Jan 28, 2026
    • Hive Storage Handler for interoperability between BigQuery and Apache Hive
      Java
      Apache License 2.0
      101965Updated Jan 29, 2025Jan 29, 2025
    • .allstar

      Public archive
      2100Updated Dec 6, 2022Dec 6, 2022
    • .github

      Public archive
      0100Updated Oct 26, 2022Oct 26, 2022
    • Python
      Apache License 2.0
      91452Updated May 27, 2022May 27, 2022
    • dataprocmagic

      Public archive
      Python
      Apache License 2.0
      4301Updated Sep 11, 2020Sep 11, 2020
    • Java
      Apache License 2.0
      7370Updated Sep 10, 2020Sep 10, 2020
    • bdutil

      Public archive
      [DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine
      Shell
      Apache License 2.0
      90109266Updated Nov 15, 2019Nov 15, 2019
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.