Adriano NicolucciinAWS in Plain EnglishUpsert Records To An Amazon Redshift Table With Small, Medium and Big Data with PythonPerforming simultaneous insert and update operations on a table can be necessary in many scenarios in Redshift. For instance, you may have…May 11, 20231May 11, 20231
Adriano NicolucciinAWS in Plain EnglishMost Common Data Architecture Patterns For Data Engineers To Know In AWSSome architectural examples of how we can process data in AWS depending on our data needs.Oct 13, 20223Oct 13, 20223
Adriano NicolucciinAWS in Plain EnglishUnderstanding All AWS Glue Import Statements and Why We Need ThemSo when you create a brand new aws glue job, I don’t know about you but it seems pretty intimidating that there are 6 python import…Jul 6, 20222Jul 6, 20222
Adriano NicolucciinAWS in Plain EnglishAWS Glue Studio: Perform PySpark SQL Queries Without Knowing SparkThere are a lot more people that know SQL than know how to program in python and are proficient in Spark to perform big data analytics on…Feb 4, 2022Feb 4, 2022
Adriano NicolucciinPython in Plain EnglishWorking With AWS Governed Tables in PythonHow to create, insert, and query records in your governed table using the AWS Data Wrangler library in PythonJan 4, 20222Jan 4, 20222
Adriano NicolucciinPython in Plain EnglishTop Must-Know Data Wrangling Operations with Python VaexWhen it comes to data analytics and building data pipelines in python, you have probably heard of pandas which is versatile for data…Sep 17, 20211Sep 17, 20211
Adriano NicolucciAWS Glue Studio-OverviewThe video below provides an overview of AWS Glue Studio which is part of the AWS Glue Service. This is a step-by-step walkthrough of the…May 16, 2021May 16, 2021
Adriano NicolucciinPython in Plain EnglishReading and Writing Data in AWS With Python Just Got A Lot SimplerPandas is an extremely popular and essential python package for data science as it’s powerful, flexible and easy to use open-source data…Feb 28, 2021Feb 28, 2021
Adriano NicolucciinPython in Plain EnglishHow To Read Parquet Files In Python Without a Distributed ClusterParquet is an open-sourced columnar storage format created by the Apache software foundation. Parquet is growing in popularity as a format…Dec 13, 2020Dec 13, 2020
Adriano NicolucciTop Features In FME 2020.1 ReleaseOn July 9th, 2020, Safe Software Inc. released FME 2020.1. Here are the top features I’m excited about. I have to say this is a pretty…Jul 19, 2020Jul 19, 2020