python

Apache Airflow optimizations

Apache Airflow is the most widely used workflow management platform. I have been building a workflow system using Apache Airflow for more than a year now. The system is required to support workflows that handle large amount of data and scale on-demand. Even though there is help and documentation available online, I ran into few

Apache Airflow optimizations Read More »

Modern Python project structure

I started learning Python programming few years ago because of the overall Eco-system. It has variety of frameworks for almost all major use-cases. Every organization is using Python to implement a software, either in a cloud or on-prem. With the exceptional collaboration in open source communities, there are tools available to code every component of

Modern Python project structure Read More »

Read data from Azure to Pandas

Pandas is very widely used Python library for data manipulation and analysis. It allows to load data in a tabular format to manipulate it for different analysis and graphical presentation. DataFrame is the key data structure in Pandas to perform tabular operations. First step while analyzing data with Pandas is to read a file and

Read data from Azure to Pandas Read More »