A data pipeline that orchestrates the ingestion of CNPJs Information from BrasilAPI directly to a Bucket on Google Cloud.
End-to-end automated pipeline: From BrasilAPI to GCS and BigQuery
Python-based engine designed to consume the BrasilAPI, handling specific registration data for 55 CNPJs.
Implementation of data engineering best practices to transform unstructured data into analytics-ready assets.
Automated loading of processed files into Google Cloud Storage with a focus on governance.
year/month/day to optimize scan costs.Deployment of a CI/CD pipeline that turns the local project into an autonomous serverless engine.
Finalizing the pipeline with data health monitoring and business-ready dashboards.