Leading the digitization of Pakistan's legal statutes and codes, involving modeling a structured schema for legislative documents, setting up infrastructure for high-throughput digitization, and ensuring zero-downtime ingestion.
Developing an ETL pipeline for Legal tech startup, scraping judgements from multiple judicial courts, processing them using LLMs (GPT 3.5) for summaries, and uploading to Digital Ocean's Virtual Machine.
Led the end-to-end recovery and migration of a mission-critical AlmaLinux production server on UpCloud, resolving high-priority boot errors and optimizing infrastructure with minimal downtime.
Designed and developed a pipeline to extract data from multiple sites, transform it for the OC feed format, and load entries using Selenium for ChromeDriver scraping and Google Sheets API.