ETL / Automation
SecondBite - OC Store ETL
A complete ETL and automation pipeline for an OpenCart e-commerce store. Extracts product data from multiple source stores, normalizes it to a standard feed format, performs match-checking against existing inventory, and loads new entries or updates existing ones based on the latest data. Google Sheets serves as the operational interface for the client.
TypeETL Pipeline / Freelance
Technologies
PythonSeleniumChromeDriverPandasGoogle Sheets APIOpenCart
Pipeline Stages
- Drive ChromeDriver via Selenium to scrape product listings from source stores, handling pagination and dynamic content.
- Normalize extracted data into the OC feed format using Pandas, handling field mapping, type coercion, and missing values.
- Match-check scraped records against existing destination store inventory using stable product identifiers.
- Load new entries or update existing ones in the destination OpenCart store based on the comparison result.
- Sync the final feed state to Google Sheets as the client-facing operational view.