Position Overview
About Us
AiLogic Neural Network Pvt Ltd is an AI-driven product company focused on building advanced language technology solutions, including machine translation, document intelligence, and large-scale NLP systems. We are looking for a highly motivated Data Engineer to join our growing AI team and contribute to the development of scalable data processing pipelines for NLP and LLM applications.
Roles & Responsibilities
Design, develop, and maintain scalable data pipelines for processing large volumes of structured and unstructured data.
Build document ingestion and processing workflows for PDFs, scanned documents, HTML pages, and other text sources.
Implement OCR, PDF parsing, HTML parsing, and text extraction pipelines.
Develop document chunking and preprocessing frameworks for NLP and LLM-based applications.
Work with Hugging Face models and NLP libraries for text processing tasks.
Create and optimize data transformation workflows using Python...