NVIDIA 2 months ago
Deep learning

NVIDIA | vLLM + SGLang | Deep Learning Inference | Remote (North America preferred)

Hi everyone — I’m Akbar, Senior Manager of Deep Learning Inference Software at NVIDIA. I lead our engineering efforts around vLLM and SGLang, two of the most widely used open-source LLM inference frameworks.

We’re building teams focused on making LLM inference faster, more efficient, and more reliable at scale — from runtime and scheduling optimizations to kernel fusion, distributed serving, and continuous integration across new GPU architectures (Hopper, Blackwell, etc.).

We’re hiring for multiple roles:

• Senior Deep Learning Software Engineer, Inference (https://nvidia.wd5.myworkdayjobs.com/nvidiaexternalcareersit...)

• Engineering Manager, Deep Learning Inference (https://nvidia.wd5.myworkdayjobs.com/nvidiaexternalcareersit...)

• DL Performance Software Engineer - LLM Inference (https://nvidia.wd5.myworkdayjobs.com/en-us/nvidiaexternalcar...)

• DL Performance Software Engineer - LLM Inference (https://nvidia.wd5.myworkdayjobs.com/en-us/nvidiaexternalcar...)

These roles are remote-friendly (North America preferred) and fully focused on upstream open-source development — working directly with the maintainers and the wider AI community.

If you’re excited about large-scale inference, compiler/runtime performance, and pushing GPUs to their limits, we’d love to talk.

Remote
2 months ago
Machine learning

Poesis (https://poesis.ai), Founding Engineers (ML, Quant, Head of Eng), On-Site (Palo Alto, CA), Full Time

Poesis is the AI-native investment manager pioneering a new foundation model for investing in U.S. equities. We’re building AI systems to predict market movements and outperform legacy managers. This is frontier research with immediate real-world validation, your work directly shapes investment decisions and portfolio performance.

We’re hiring founding technical roles to build the first generation of our trading and ML systems:

Head of Engineering: own architecture, pipelines, and productization of research; define the technical backbone of the fund.

Founding ML Engineer: build end-to-end ML systems for data ingestion, training, backtesting, and signal generation.

Founding Quant Developer: turn research ideas into production-grade code, working closely with leadership on real trading systems.

We’re a small, deeply technical team based near Stanford, working several days a week on-site in Palo Alto. Relocation support available.

If you’re excited by the intersection of AI, finance, and first-principles design, we’d love to meet you.

Apply here: https://jobs.ashbyhq.com/poesis?utm_source=7b1zo0bvxd

Onsite Full-time
Silk Hedge Fund 2 months ago

Silk Hedge Fund | https://silk.fund | Co-founder | In-person NYC

Silk is a fully autonomous, full stack AI hedge fund that utilizes large language models to orchestrate the complete spectrum of hedge fund operations—from quantitative research and technical analysis to trade execution, position management, and portfolio oversight. Silk is being designed to support several hundred autarkic cloud-based LLM instances that can each perform their own real time market analysis and trade stocks on a custom recurrence schedule, with custom MCP tools and infrastructure, using initial conditions (configurable strategies) set by humans. The end state for Silk is a multi-prop hedge fund that invests in financial markets with zero humans in the loop.

I'm currently a solo founder working on this full time (2 months). Please reach out if interested. Also looking for advisors / mentors.

Alek Turkmen | Email is hidden | https://www.linkedin.com/in/alekturkmen/ | https://alekturk.men

Full-time
Two Dots 2 months ago
BigQuery Google Cloud GraphQL Machine learning Node.js PostgreSQL React.js

Two Dots | https://twodots.net | Engineers, AEs, CSMs | Onsite in SF | Full-time

Fully automated agentic consumer finance chat bot / betting on consumer ability to repay / catching criminals at scale with forensics

Full Stack - GCP / BigQuery / Postgres / Node.js (Prisma/GraphQL) / React

ML - Torch

https://jobs.ashbyhq.com/two-dots

Onsite Full-time
Yuzu 2 months ago
Next.js

Yuzu | https://yuzu.health | Engineers & Designers | Onsite in NYC | Full-time At Yuzu, we’re building the next-generation health insurance company. We are NOT building a digital brokerage or an AI wrapper - we are going deeper to build the foundational infrastructure required to power tomorrow’s health plans.

We're hiring aggressively across the board, see open roles at https://yuzu.health/careers

Onsite Full-time
Suno 2 months ago
Android Django IOS Kotlin Next.js Python React.js TypeScript

Suno | Cambridge, MA | New York, NY | Full-time | Onsite

We are building a future where anyone can make music. We’re scaling our engineering team and hiring for the following roles:

- Software Engineer, Android - Jetpack Compose, Kotlin

- Software Engineer, iOS - SwiftUI

- Software Engineer, Fullstack Web - Typescript, React, NextJS, Python, Django - Software Engineer, Growth - all the things

- AI Researchers

- Other roles here https://jobs.ashbyhq.com/suno

Apply at the link above or feel free to shoot me an email at Email is hidden if you have any questions or want to learn more.

Onsite Full-time
Pelairo Inc. 2 months ago
AWS Cognito Nest.js Next.js Node.js React.js TypeScript

Pelairo Inc. | Principal Engineer / Tech Lead | REMOTE (US) | Full-time

At Pelairo we're building the most intuitive and flexible Laboratory Information Management System (LIMS) on the market. "Manage the lab, not the LIMS!"

We're looking for a hands-on Principal Engineer to lead the design and delivery of our backend—built on TypeScript + NestJS—and help our small team iterate quickly with confidence. You'll turn ambiguity into shippable slices, help set technical direction, and raise the bar on reliability, security, and speed.

What you'll do:

- Own the backend: spend meaningful time writing and reviewing production code. - Architecture stewardship: keep the system coherent as we scale questionnaire-driven workflows and FHIR pipelines. - Reliability & quality: Establish testing strategy (unit/integration/contract/E2E), migrations, rollouts (feature flags). - Observability: Instrument tracing/metrics/logging, build dashboards/alerts; drive incident reviews. - Raise the engineering bar: define and enforce code review policy, branching model, testing strategy, and CI/CD gates. - Partner with Product & domain experts: translate healthcare workflows into configurable front-end modules; ensure back-end transforms to valid FHIR in Medplum. What makes you a great fit:

- 8+ years building production systems, including principal/staff scope in a startup or similar pace. - Deep TypeScript + Node.js and expert-level NestJS in production. - Track record leading cross-team projects and influencing architecture without heavy process. - Product sense and crisp communication; comfortable owning ambiguous problems and shipping quickly with guardrails. - Bonuses: FHIR/Medplum, healthcare data workflows, HIPAA/SOC 2 experience, comfortable jumping into modern frontend (React/Next.js) when needed.

Stack: TypeScript (Next.js and NestJS), Medplum, AWS (Cognito, Fargate)

Visa/relocation: No

How to apply: Email hiring [@] pelairo.com and mention "HN Who's Hiring November 2025". (We know the website needs some work, but the product is our priority right now.)

Remote Full-time
Railway 2 months ago
Bash PostgreSQL

Railway | Product Design, Customer Success Engineer, Product Eng (full stack)| REMOTE (Worldwide) | https://jobs.ashbyhq.com/railway?utm_source=daymzwzj0p

Tired of trying to beat kube into shape? Does writing YAML to ship code fill you with utter dread? Dream of a future where deploying software is simple, and you don't need an army of infrastructure engineers to build that perfect janky bash script™ to make life easy?

We're Railway, and we think infrastructure can be better. So far we've built out a platform loved by hundreds of thousands of users who simply tell us "Give me Postgres", "Deploy this repo", and we make it happen

Fair warning! The problems are complex: home-rolled hypervisors, cut-above container orchestration, over/under/whateverlay networks, virtio device drivers, edge proxies, IAM that doesn't suck, kitchen sinks - we need to build it and we're looking for likeminded individuals who think this stuff is fun.

Three open roles (apply below!):

+Product Designer: -Apply here: (https://jobs.ashbyhq.com/railway/6fb07755-acd8-4400-9de3-fa5...)

+Customer Success Engineer: -Apply here: (https://jobs.ashbyhq.com/railway/dbc51554-cdb8-47d5-8071-5a9...)

+Product Engineer (full-stack): -Apply here: (https://jobs.ashbyhq.com/railway/6ddcfe47-6cce-469b-ba6d-4f0...) -Blog post about the team: (https://blog.railway.app/p/team-spotlight-product-engineerin...)

See you soon, and happy shipping.

Remote
Zenact AI 2 months ago
AWS Docker Golang Java Python

Zenact AI | Founding Engineers & Interns | Full-Time + 6-Month Internships | Onsite Bangalore | Location flexible for internships (India) Tech: Golang • Python • AI Agents

At Zenact AI, we are building AI agents that test apps like real users. I personally faced this problem at Zomato for over 6 years while handling many bugs and incidents.

We launched recently and already got 35+ signups from leading unicorns & soonicorns in India.

Backed by the Zomato mafia.

Team comes with deep expertise from Zomato’s scale journey.

## Roles:

* Founding Engineers.

* Interns (6 months). Must’ve built serious projects or freelanced early in college.

## Tech Stack:

Golang, Python, Java(5%), Appium, AWS, Docker

## You’ll work on:

* Building the platform from the active feedback from the customers, with heavy focus on improving the end to end latency of testing.

* Fine-tuned vision & reasoning models (currently 92% accuracy vs SOTA ~60%)

* AI agents for mobile testing, reasoning flows.

Apply: shoot a mail to Email is hidden or fill form here: https://forms.gle/ywpromowha4zf4gv6

Onsite Full-time