AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

ZlZFWXdTV3NSRCt4UkV5TXVvQUlsRWxzL3c9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

TARGAN Inc.

Inventory Planner Job at TARGAN Inc.

 ...Position Summary: The Inventory Planners primary responsibility will be to support our field service team ensuring inventory is in stock, on time and at the appropriate levels to keep our systems up and running. They will own planning, forecasting, and coordinating spare... 

Petroplan

Designer Job at Petroplan

Job Title: Fixture Designer Location: Greenville, SC Schedule: Monday Friday, 7:00 AM 4:00 PM Assignment Duration: 1218 months Work Environment: On-site only Must be a US Citizen Job Summary: Seeking an experienced Fixture Designer to support...

MUST Ministries

Design and Creative Director Job at MUST Ministries

 ...requires flexibility to attend events, manage multiple projects simultaneously, and represent MUST in various capacities. Must Ministries is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, race,... 

Ultimate Staffing

Customer Service Coordintor (Start ASAP) Job at Ultimate Staffing

 ...We are hiring for a Customer Service Coordintor to join our growing our team! Job Duties: Collaborate with sales team to exceed customer service expectations. Provide accurate information on pricing, inventory, shipping costs, and delivery times. Respond to... 

The Tower Companies

Property Accounting Assistant Job at The Tower Companies

 ...to the accounts receivable and accounts payable processes and data entry for all properties in The Blairs District. You will be an overall...  ..., and support residents with rent, lease, ledger, and online payment-related queries. Review and post guest parking and misc...