RAG Architect, Engineer
Company: Nutanix
Location: San Diego
Posted on: June 2, 2025
Job Description:
Company:Qualcomm India Private LimitedJob Area:Engineering Group,
Engineering Group > Software Test EngineeringGeneral Summary:Job
descriptionWe are seeking an experienced AI Architect to design,
develop, and deploy Retrieval-Augmented Generation (RAG) solutions
for Qualcomm Cloud AI Platforms.
Roles and Responsibilities
- Lead the design and development of applications for RAG AI
models and provide APIs for frontend consumption. Manage the
interaction between retrieval-augmented techniques and generative
models.
- Build services that connect AI models (e.g., transformers,
embeddings, and vector search) to handle tasks such as query
retrieval, model inference, and generating responses. Leverage
frameworks like Flask, FastAPI, or Django for API development.
- Design pipelines to preprocess, clean, and prepare data for AI
model training, as well as for serving the models in production
environments. Optimize these pipelines to support both batch and
real-time data processing. Implement RESTful APIs or GraphQL
endpoints for seamless frontend-backend interaction.
- Implement cloud solutions to host Python-based services,
ensuring that AI models are scalable and that the infrastructure
can handle high traffic. Leverage containerization (Docker) and
orchestration (Kubernetes) for model deployment and
management.
- Set up monitoring, logging, and alerting for Python backend
services, ensuring smooth operation of AI features. Use tools like
Prometheus, Grafana, and ELK stack for real-time performance
tracking.
- Continuously optimize model performance by fine-tuning and
adapting Python-based AI models for real-time use cases. Manage
trade-offs between computation load, response time, and quality of
generated content.
- Partner with data scientists, machine learning engineers, and
mobile/web developers to ensure tight integration between AI
models, mobile/web front-end, and backend infrastructure.-
Experience:
- 2+ years of overall SW development experience
- 2 years Strong experience in working with technologies (e.g.,
React, React Native, Flutter, Django, Flask, FastAPI).
- 2+ years of experience in building AI applications with a focus
on NLP, machine learning, generative models, and
retrieval-augmented systems.
- Proven experience in designing and deploying AI systems that
integrate retrieval-based techniques (e.g., FAISS, Weaviate) and
generative models (e.g., GPT, BERT).
- Expertise in cloud platforms (e.g., AWS, GCP, Azure) and
deployment of Python-based microservices.
- Building RESTful APIs or GraphQL services (using frameworks
like Flask, FastAPI, or Django).
- Handling AI model inference and data processing (using
libraries like NumPy, Pandas, TensorFlow, PyTorch, and Hugging Face
Transformers).
- Integrating vector search solutions (e.g., FAISS, Pinecone,
Weaviate) with the AI models for efficient retrieval-augmented
generation. - Experience with containerization (Docker) and
Kubernetes for deploying scalable Python-based services.
- Proficient in cloud infrastructure management, with a focus on
managing Python services in the cloud.
- Experience in End-to-End product development and Software
LifecycleKey Skills:
- Advanced proficiency in Python for building backend services
and data processing pipelines. Familiarity with frameworks like
Flask, Django, and FastAPI.
Experience with AI libraries and frameworks (TensorFlow, PyTorch,
Hugging Face Transformers).
- Familiarity with vector databases (e.g., Pinecone, FAISS,
Weaviate) and integration with retrieval-augmented systems.
- Strong knowledge of RESTful API design, GraphQL, and API
security best practices (e.g., OAuth, JWT).
- Excellent problem-solving abilities and a strong focus on
creating highly scalable and performant solutions.
- Strong communication skills, with the ability to collaborate
across different teams and geography
- Ability to mentor junior team members and lead technical
discussions.Minimum Qualifications:--- Bachelor's degree in
Engineering, Information Systems, Computer Science, or related
field.Applicants: Qualcomm is an equal opportunity employer. If you
are an individual with a disability and need an accommodation
during the application/hiring process, rest assured that Qualcomm
is committed to providing an accessible process. You may e-mailor
call Qualcomm's toll-free number found. Upon request, Qualcomm will
provide reasonable accommodations to support individuals with
disabilities to be able participate in the hiring process. Qualcomm
is also committed to making our workplace accessible for
individuals with disabilities. (Keep in mind that this email
address is used to provide reasonable accommodations for
individuals with disabilities. We will not respond here to requests
for updates on applications or resume inquiries).Qualcomm expects
its employees to abide by all applicable policies and procedures,
including but not limited to security and other requirements
regarding protection of Company confidential information and other
confidential and/or proprietary information, to the extent those
requirements are permissible under applicable law.To all Staffing
and Recruiting Agencies:Our Careers Site is only for individuals
seeking a job at Qualcomm. Staffing and recruiting agencies and
individuals being represented by an agency are not authorized to
use this site or to submit profiles, applications or resumes, and
any such submissions will be considered unsolicited. Qualcomm does
not accept unsolicited resumes or applications from agencies.
Please do not forward resumes to our jobs alias, Qualcomm employees
or any other company location. Qualcomm is not responsible for any
fees related to unsolicited resumes/applications.If you would like
more information about this role, please contact .
#J-18808-Ljbffr
Keywords: Nutanix, Laguna Niguel , RAG Architect, Engineer, Professions , San Diego, California
Didn't find what you're looking for? Search again!
Loading more jobs...