AWS Glue Studio: Perform PySpark SQL Queries Without Knowing Spark
Published in
4 min readFeb 4, 2022
There are a lot more people that know SQL than know how to program in python and are proficient in Spark to perform big data analytics on their data. With AWS Glue Studio, it’s possible to build data pipelines for big data analytics on a distributed cluster without knowing to code a single line of spark code. This tutorial below is a walk-through on how to create a glue job…