Open in app

Sign In

Write

Sign In

Adriano N
Adriano N

255 Followers

Home

About

Published in AWS in Plain English

·Oct 13, 2022

Most Common Data Architecture Patterns For Data Engineers To Know In AWS

Some architectural examples of how we can process data in AWS depending on our data needs. — There are many ways to process data in AWS with the many AWS Services that are available. There is also a growing demand in organizations to update systems in more real time. In my opinion, Data engineers should not silo themselves by just having skills to build batch processing data…

AWS

8 min read

Most Common Data Architecture Patterns For Data Engineers To Know In AWS
Most Common Data Architecture Patterns For Data Engineers To Know In AWS
AWS

8 min read


Published in AWS in Plain English

·Jul 6, 2022

Understanding All AWS Glue Import Statements and Why We Need Them

So when you create a brand new aws glue job, I don’t know about you but it seems pretty intimidating that there are 6 python import statements that are generated automatically. …

AWS

4 min read

Understanding All AWS Glue Import Statements and Why We Need Them
Understanding All AWS Glue Import Statements and Why We Need Them
AWS

4 min read


Published in AWS in Plain English

·Feb 4, 2022

AWS Glue Studio: Perform PySpark SQL Queries Without Knowing Spark

There are a lot more people that know SQL than know how to program in python and are proficient in Spark to perform big data analytics on their data. With AWS Glue Studio, it’s possible to build data pipelines for big data analytics on a distributed cluster without knowing to…

AWS

4 min read

AWS Glue Studio: Perform PySpark SQL Queries Without Knowing Spark
AWS Glue Studio: Perform PySpark SQL Queries Without Knowing Spark
AWS

4 min read


Published in Python in Plain English

·Jan 4, 2022

Working With AWS Governed Tables in Python

How to create, insert, and query records in your governed table using the AWS Data Wrangler library in Python — Governed Tables, is a new type of table on Amazon S3 that supports ACID transactions which can help you build more resilient data pipelines for your data lake on AWS. Governed tables can be enabled on top of your data in AWS S3 in the following formats: Avro, CSV, JSON…

AWS

4 min read

Working With AWS Governed Tables in Python
Working With AWS Governed Tables in Python
AWS

4 min read


Published in Python in Plain English

·Sep 17, 2021

Top Must-Know Data Wrangling Operations with Python Vaex

When it comes to data analytics and building data pipelines in python, you have probably heard of pandas which is versatile for data analytics on a single machine and Pyspark for data analytics in a distributed environment when you are working with too much data for one machine to handle…

Python

3 min read

Top Must-Know Data Wrangling Operations with Python Vaex
Top Must-Know Data Wrangling Operations with Python Vaex
Python

3 min read


May 16, 2021

AWS Glue Studio-Overview

The video below provides an overview of AWS Glue Studio which is part of the AWS Glue Service. This is a step-by-step walkthrough of the various components of AWS Glue Studio and how to use use it.

Aws Glue

1 min read

Aws Glue

1 min read


Published in Python in Plain English

·Feb 28, 2021

AWS Data Wrangler Overview

Pandas is an extremely popular and essential python package for data science as it’s powerful, flexible and easy to use open-source data analysis and data manipulation. …

Pandas

6 min read

Reading and Writing Data in AWS With Python Just Got A Lot Simpler
Reading and Writing Data in AWS With Python Just Got A Lot Simpler
Pandas

6 min read


Published in Python in Plain English

·Dec 13, 2020

How To Read Parquet Files In Python Without a Distributed Cluster

Parquet is an open-sourced columnar storage format created by the Apache software foundation. Parquet is growing in popularity as a format in the big data world as it allows for faster query run time, it is smaller in size and requires fewer data to be scanned compared to formats such…

Parquet

2 min read

How To Read Parquet Files In Python Without a Distributed Cluster
How To Read Parquet Files In Python Without a Distributed Cluster
Parquet

2 min read


Jul 19, 2020

Top Features In FME 2020.1 Release

On July 9th, 2020, Safe Software Inc. released FME 2020.1. Here are the top features I’m excited about. I have to say this is a pretty significant .1 release comparted to some .1 releases in previous years. Readers and Writers: FME has always had unrivalled spatial support and they have continued to expand…

Fme Software

3 min read

Top Features In FME 2020.1 Release
Top Features In FME 2020.1 Release
Fme Software

3 min read


Nov 3, 2019

A Beginner’s Guide to the Pre-Construction Condo Process In Toronto

So you’re interested in buying a pre-construction condo to live in or as an investment but don’t know where to start? Here’s a guide from start to finish on purchasing a condo unit based on my recent personal experience. If you do not mind waiting a few years to move…

Real Estate

6 min read

A Beginner’s Guide to the Pre-Construction Condo Process In Toronto
A Beginner’s Guide to the Pre-Construction Condo Process In Toronto
Real Estate

6 min read

Adriano N

Adriano N

255 Followers

I am a Solution Architect Consultant focusing on building Data platforms on AWS.

Following
  • Jose Antonio Ribeiro Neto (Zezinho)

    Jose Antonio Ribeiro Neto (Zezinho)

  • Giorgos Myrianthous

    Giorgos Myrianthous

  • James Garside

    James Garside

  • Kesi Parker

    Kesi Parker

  • Anne Nasato

    Anne Nasato

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech