LLM Inference Engineer

NEW

VeqtoRemote · US/EUSenior1h ago

$220k–$300ksalary band

US/EUtimezone

Seniorlevel

vLLMCUDARust

the role

Make our serving stack faster and cheaper — kernels, continuous batching, speculative decoding, end to end.

what you'll own

requirements

about Veqto

Veqto serves open-weight models to developers. Profitable, ~30 people, infrastructure-obsessed.

more remote roles

NEW

Tensor HarborRemote · US/EU$185k–$245k1h ago

ElasticsearchpgvectorPython

NEW

SebbleRemote · Global$170k–$230k1h ago

OpenAILangChainTypeScript

NEW

LooplyticRemote · US/EU$190k–$260k1h ago

KubernetesTritonPython