Skip to main content

DataProc

Example

Run on Google cloud's dataproc:

gcloud dataproc batches submit \
--project your-project \
--region us-central1 \
spark \
--version 1.2 \
--subnet default \
--class com.pany.spark.SomeSparkJob \
--jars gs://your-bucket/definity-spark-agent-X-X.jar \
--properties spark.plugins=ai.definity.spark.plugin.DefinitySparkPlugin,spark.definity.server=https://app.definity.run,spark.definity.api.token=$DEFINITY_API_TOKEN,spark.definity.env.name=demo,spark.definity.pipeline.name=example_pipeline

Compatibility matrix

Dataproc ImageSpark VersionScala VersionDefinity Agent
2.33.5.32.12.183.5_2.12-latest
2.23.5.32.12.183.5_2.12-latest
2.13.3.22.12.183.3_2.12-latest
2.03.1.32.12.143.1_2.12-latest
1.52.4.82.12.102.4_2.12-latest
1.42.4.82.11.122.4_2.11-latest
1.32.3.42.11.82.3_2.11-latest