Talha IjlalTalha Ijlal

ETL / Automation

SecondBite - OC Store ETL

A complete ETL and automation pipeline for an OpenCart e-commerce store. Extracts product data from multiple source stores, normalizes it to a standard feed format, performs match-checking against existing inventory, and loads new entries or updates existing ones based on the latest data. Google Sheets serves as the operational interface for the client.

TypeETL Pipeline / Freelance
Technologies
PythonSeleniumChromeDriverPandasGoogle Sheets APIOpenCart

Pipeline Stages

  1. Drive ChromeDriver via Selenium to scrape product listings from source stores, handling pagination and dynamic content.
  2. Normalize extracted data into the OC feed format using Pandas, handling field mapping, type coercion, and missing values.
  3. Match-check scraped records against existing destination store inventory using stable product identifiers.
  4. Load new entries or update existing ones in the destination OpenCart store based on the comparison result.
  5. Sync the final feed state to Google Sheets as the client-facing operational view.