← Back to Jobs
Actualize | Pune, India | Posted May 30, 2026
Position Overview
Key Responsibilities
- Design, develop, and maintain scalable data architectures for structured and unstructured data including text, images, audio, and video.
- Build and optimize enterprise ETL/ELT pipelines using Python, SQL, Spark/PySpark, and Databricks.
- Integrate and process data from enterprise platforms such as SAP, Oracle, Azure Data Lake, and other cloud/on-prem systems.
- Develop high-performance data pipelines to support AI/ML, computer vision, predictive analytics, and Generative AI use cases.
- Implement large-scale image and video preprocessing workflows for AI-driven applications.
- Work with feature stores, vector databases, embeddings, and LLM-based data workflows.
- Ensure data quality, governance, lineage tracking, metadata management, and security compliance across platforms.
- Collaborate with AI engineers, data scientists, and cross-functional teams to deliver production-ready data solutions.
- Optimiz...