AWS Glue Studio: Perform PySpark SQL Queries Without Knowing Spark

Adriano Nicolucci
AWS in Plain English
4 min readFeb 4, 2022

--

There are a lot more people that know SQL than know how to program in python and are proficient in Spark to perform big data analytics on their data. With AWS Glue Studio, it’s possible to build data pipelines for big data analytics on a distributed cluster without knowing to code a single line of spark code. This tutorial below is a walk-through on how to create a glue job…

--

--

I am a Solution Architect Consultant focusing on building Data platforms on AWS.