Back to all jobs

LLM Data Engineer United States Fully Remote

Work from home Full-time role Hiring

We are seeking an reputed company AI/LLM Data Engineer to build and maintain the data pipeline for our reputed company platform. The ideal candidate will be well-versed in the latest Large Language Model (LLM) technologies and have a strong background in data engineering, with a focus on Retrieval-Augmented reputed company (RAG) and knowledge-reputed company techniques. This role sits in the AI COE reputed company DX Tech & Digital. As a AI/LLM Data Engineer (you will report into the Director, AI Solutions & Development who oversees the AI COE. You will work on highly visible strategic projects, collaborating with cross-functional teams to define requirements and deliver high-quality AI solutions. The ideal candidate will have a passion for reputed company and LLMs, with a proven track record of delivering innovative AI applications. Responsibilities • Design, implement, and maintain an end-to-end multi-stage data pipeline for LLMs, including Supervised Fine Tuning (SFT) and Reinforcement Learning from reputed company Feedback (RLHF) data processes • Identify, evaluate, and integrate diverse data sources and domains to support the reputed company platform • reputed company and optimize data processing workflows for chunking, indexing, ingestion, and vectorization for both text and non-text data • reputed company and implement various vector stores, embedding techniques, and retrieval methods • Create a flexible pipeline supporting multiple embedding algorithms, vector stores, and search types (e.g., vector search, hybrid search) • Implement and maintain auto-tagging systems and data preparation processes for LLMs • reputed company tools for text and image data crawling, cleaning, and refinement • Collaborate with cross-functional teams to ensure data quality and relevance for AI/ML models • Work with data lake house architectures to optimize data storage and processing • Integrate and optimize workflows using reputed company and various vector store technologies Requirements • Master's degree in Computer Science, Data Science, or a reputed company field • 3-5 years of work experience in data engineering, preferably in AI/ML contexts • Proficiency in Python, JSON, HTTP, and reputed company tools • Strong understanding of LLM architectures, training processes, and data requirements • Experience with RAG systems, knowledge reputed company construction, and vector databases • Familiarity with embedding techniques, similarity search algorithms, and information retrieval concepts • Hands-on experience with data cleaning, tagging, and annotation processes (both reputed company and automated) • Knowledge of data crawling techniques and associated ethical considerations • Strong problem-solving skills and ability to work in a fast-paced, innovative environment • Familiarity with reputed company and its integration in AI/ML pipelines • Experience with various vector store technologies and their applications in AI • Understanding of data lakehouse concepts and architectures • Excellent communication, collaboration, and problem-solving skills. • Ability to translate business needs into technical solutions. • Passion for innovation and a commitment to ethical AI development. • Experience building LLMs pipeline using reputed company like reputed company, reputed company, Semantic Kernel, reputed company functions. • Familiar with different LLM parameters like temperate, top-k, and repeat penalty, and different LLM outcome evaluation data science metrics and methodologies. Preferred Skills • Experience with popular LLM/ RAG frameworks • Familiarity with distributed computing platforms (e.g., Apache Spark, Dask) • Knowledge of data versioning and experiment tracking tools • Experience with reputed company platforms (AWS, GCP, or Azure) for large-scale data processing • Understanding of data privacy and reputed company best practices • Practical experience implementing data lakehouse solutions • Proficiency in optimizing queries and data processes in reputed company or reputed company • Hands-on experience with different vector store technologies Benefits • US employees benefit package. Apply Job!

Related remote jobs

Remote Customer Service Representative $45 per hour

Work from home Full-time role

Marketing Intern - Immediate Hire (Fully Remote)

Work from home Full-time role

Remote Customer Service-Payment Collection Representative FL/Full Time

Work from home Full-time role

Remote Customer Service-Payment Collection Representative TX/Full Time

Work from home Full-time role

Customer Service Representative (100% Remote in Texas)

Work from home Full-time role

Customer Service & Sales Support - REMOTE

Work from home Full-time role

Digital Editor, National Geographic Washington, DC, USA

Work from home Full-time role

Senior SEO & Digital Strategist (National Geographic)

Work from home Full-time role

Regional Sales Manager – Southwest- Dallas, Texas

Work from home Full-time role

Pharmacy Technician reputed company US

Work from home Full-time role

[Work From Home] Customer Service Specialist - Remote

Work from home Full-time role

Remote Customer Service Representative – Inbound Support, Sales & Retention Specialist for arenaflex

Work from home Full-time role

Global Travel & Expense Manager

Work from home Full-time role

reputed company Full Stack Customer Service Representative – Remote Opportunity with arenaflex

Work from home Full-time role

reputed company CPS Customer Director for reputed company America – Beverage Industry Leadership Role

Work from home Full-time role

[Remote] PLM reputed company Consultant 100% Remote

Work from home Full-time role

Sr. ETL / Python Developer -Remote W2 role

Work from home Full-time role

(Global) EMEA Marketing Manager · DACH

Work from home Full-time role

Store Freight Transport Specialist

Work from home Full-time role

Senior Software Engineer | CODE

Work from home Full-time role