Note: The job is a remote job and is open to candidates in USA. Cayuse Holdings is looking for a Senior AI Software Engineer to advance enterprise AI initiatives by transforming proof-of-concept solutions into scalable web applications. The role involves developing production-grade AI/ML services, collaborating with cross-functional teams, and ensuring compliance with enterprise standards.
Responsibilities
- Design, develop, and maintain production‑grade AI/ML services and web applications that extend existing POC solutions into scalable, secure, and reliable enterprise platforms
- Implement and optimize AI/ML workflows for: Model ingestion and lifecycle management, Automated quantity extraction from plans and documents, Plan conformance and rules‑based checks, Computer vision–based asset detection and inspection, NLP/LLM‑based plan review automation and document analysis
- Build secure, user‑friendly web interfaces and APIs that enable engineering and business users to leverage AI capabilities within their day‑to‑day workflows
- Architect, implement, and manage CI/CD pipelines to support rapid, reliable deployment of AI/ML models and related services
- Deploy and manage AI/ML workloads across one or more major cloud platforms (AWS, Azure, GCP, OCI), leveraging native AI/ML services as appropriate
- Implement MLOps best practices, including experiment tracking, model registry, feature stores, monitoring, and automated retraining where appropriate
- Optimize model performance and cost through techniques such as quantization, pruning, distillation, and efficient distributed training
- Integrate and operationalize LLM and NLP solutions (e.g., transformers, RAG systems) to support text understanding, summarization, Q&A, and other intelligent automation use cases
- Collaborate with data engineers, cloud engineers, and domain experts to design robust data pipelines and architectures for AI/ML workloads, including time‑series, image/video, and text data
- Ensure that all solutions adhere to security, compliance, and governance standards, especially when working with sensitive or regulated data
- Provide technical leadership, mentorship, and guidance to junior engineers and peers, promoting best practices in AI/ML engineering, DevOps, and software craftsmanship
- Produce high‑quality technical documentation, including architecture diagrams, API specifications, deployment runbooks, and user guides
- Participate in technical planning, backlog grooming, and estimation; contribute to roadmap development for AI/ML capabilities
Skills
- 8+ years of professional software engineering experience, with substantial work in AI/ML and cloud‑native development
- Experience with at least one major cloud platform (AWS, Azure, GCP, or OCI) for deploying and managing ML workloads
- Hands‑on experience with cloud AI/ML services such as Azure AI, AWS SageMaker/Bedrock, GCP Vertex AI, or OCI AI Services
- Strong DevOps background, including: Ansible for configuration management and automation, Docker for containerization, Kubernetes for container orchestration, CI/CD best practices for automated build, test, and deployment
- Proficiency with relational and non‑relational databases, including: SQL (PostgreSQL, MySQL), NoSQL and vector databases for similarity search and embedding‑based retrieval
- Strong scripting skills in both: Bash, PowerShell
- Proven experience designing and maintaining CI/CD pipelines using: Azure DevOps, GitHub Actions, Jenkins, or similar automation tools
- 3–5+ years of production‑level Python development (primary implementation language)
- 3+ years of experience with NLP and LLMs, including: Transformer models (BERT, GPT, T5, etc.), RAG (Retrieval‑Augmented Generation) systems, Fine‑tuning and prompt engineering, Building LLM‑based applications
- 3+ years of experience with time‑series data, including: Forecasting models, Anomaly detection, Sequential data modeling, Real‑time monitoring systems
- 3+ years of experience building recommender systems, such as: Collaborative filtering, Ranking models, Personalization engines, Content recommendation pipelines
- Production experience with MLOps tools and platforms, such as: MLflow, Weights & Biases, Kubeflow, Airflow, or similar systems for orchestration, tracking, and model lifecycle management
- Experience with distributed training, including: Large‑scale model training, Multi‑GPU and/or multi‑node setups, Data/model parallelism and performance optimization
- Production computer vision experience using: PyTorch and/or TensorFlow, OpenCV, YOLO or similar frameworks for object detection and segmentation, Real‑time inference and deployment workflows
- Experience with feature stores (e.g., Feast, Tecton) and/or advanced feature engineering techniques
- Hands‑on experience with model optimization techniques: Quantization, Pruning, Knowledge distillation
- Experience working with LLM ecosystems such as: Ollama, Hugging Face, Other non‑frontier / open‑weight models
- Demonstrated AI/ML production track record: Built and deployed at least 2–3+ ML models serving real users (beyond experimental or research‑only projects)
- Must be able to pass a background check. May require additional background checks as required by projects and/or clients at any time during employment
- Exceptional interpersonal skills with the ability to communicate in a clear, professional, and articulate manner
- Exceptional verbal and written communication skills
- Excellent organizational, analytical, and problem-solving skills with high-level attention to detail
- Proven ability to multitask and prioritize in a fast past environment with changing priorities; adaptable to change and a quick learner
- Must be self-motivated and able to work well independently as well as on a multi-functional team
- Ability to handle sensitive and confidential information appropriately
- Proficient in MS Office, Word, Outlook, PowerPoint, and Excel
- 1+ year of experience with Geospatial Information Systems (GIS) and analyzing or modeling spatial data
- Prior experience in one or more of the following domains: Transportation, Logistics, Smart city or urban infrastructure
- Background applying computer vision to infrastructure or vehicular data, including: Object detection, Image segmentation, Video or sensor data analysis
- Familiarity with public sector data compliance, security, and governance, such as: Data classification and handling, Access control and audit requirements, Regulatory and policy constraints for government data
- Experience with Unreal Engine in the context of: Real‑world digital twinning, Simulation or immersive visualization of physical environments
- Experience integrating or building solutions with: Google Maps APIs, Cesium or similar 3D mapping/geospatial visualization platforms
- Experience with Polygonflow Dash and its capabilities for: 3D workflows, Visualization pipelines, Automation of complex modeling or simulation tasks
Benefits
- Medical, Dental and Vision Insurance; Wellness Program
- Flexible Spending Accounts (Healthcare, Dependent Care, Commuter)
- Short-Term and Long-Term Disability options
- Basic Life and AD&D Insurance (Company Provided)
- Voluntary Life and AD&D options
- 401(k) Retirement Savings Plan with matching after one year
- Paid Time Off
Company Overview
Cayuse Holdings is an economic enterprise that specializes in providing sourcing and diversity solutions. It was founded in 2018, and is headquartered in Pendleton, Oregon, USA, with a workforce of 501-1000 employees. Its website is https://www.cayuseholdings.com/.