DepartmentInfrastructureEmployment TypeFull TimeLocationRemoteReporting ToEvgenii PerepelkinDescriptionTabby creates financial freedom in the way people shop, earn and save by reshaping their relationship with money. Over 15 million users choose Tabby to stay in control of their spending and make the most out of their money. The company's flagship offering allows shoppers to split their payments online and in-store with no interest or fees. Over 40,000 global brands and small businesses, including Amazon, Noon, IKEA, and SHEIN use Tabby to accelerate growth and gain loyal customers by offering easy and flexible payments online and in stores. Tabby generates over $10 billion in annual transaction volume for its partner brands and is the highest‐rated, most‐reviewed, largest, and fastest‐growing Fin Tech in the GCC region. Tabby launched in 2019 and has since raised +$1 billion in equity and debt funding from global and regional investors, and is now valued at $4.5 billion.Key Skills And ResponsibilitiesLLM Serving & Model ManagementDeep expertise in high‐throughput serving using v LLM, NVIDIA Tensor RT-LLM, and sglang to minimize latency and maximize hardware efficiency.Hands‐on experience deploying and optimizing large‐scale open‐weights models, specifically Deep Seek 3.1/3.2, Qwen, and GPT‐OSS variants.Advanced optimization and security hardening of Docker specifically for GPU environments.Managing model weights and orchestration within Kubernetes (GKE) environments.Real‐Time Data Engineering & CDCDesigning and maintaining high‐throughput CDC (Change Data Capture) pipelines using the Apache ecosystem (e.g., Debezium, Kafka) to sync data from Cloud Postgre SQL.Deploying and tuning Click House for real‐time analytics, ML feature storage, and high‐speed logging.Orchestrating complex ML data workflows using Airflow (Google Cloud Composer) to ensure data reliability.Core Infrastructure & NetworkingStrong Linux systems expertise including internals, networking, and performance tuning for large‐scale distributed systems.Experience with Istio service mesh to manage microservices communication and traffic.Provisioning and maintaining dedicated GPU nodes (A100/H100/H200/B200), including driver management and OS‐level tuning using Ansible.Solid Kubernetes expertise: controllers, CRDs, CNI, and Ingress.CI/CD & ToolingImplementing pipelines as code within Git Lab CI, managing runners, caching, and security scanning.Infrastructure as Code with Terraform and Terragrunt.Proficiency in Python/Bash for building custom automation and AI Agent tooling.Load Testing & ObservabilityConducting rigorous load testing for Gen AI applications, focusing on metrics like TTFT, TPS, and RPS.Deploying and managing Lite LLM Gateway for unified API access, load balancing, and cost tracking.Experience with Datadog for monitoring GPU utilization, inference health, and log pipelines.Soft SkillsStrong ownership mindset: balancing speed, reliability, and cost.Comfortable working cross‐functionally with developers, security, and compliance.Excellent sense of responsibility and accountability.English B2 or higher.Nice to HaveExperience with PCI‐DSS, SOC2, or regulations compliance environments.Our Tech StackLinux, Docker, Kubernetes, GCP (GKE, Cloud Postgre SQL), Datadog, Git Lab, Apache CDC, Click House, Airflow, Istio, Terraform, Terragrunt, Ansible, v LLM, Tensor RT-LLM, sglang, Lite LLM, Deep Seek, Qwen, Go, PythonWhat we offerFull‐time B2 B contractFully remote setup, work from anywhere in EuropeUp to 20% tax allowance22 paid leave days annuallyStock options (ESOP) in a fast‐scaling, pre‐IPO companyFlexi benefits you can use for wellness, travel, or learningWork alongside a high‐performing, international engineering team in a global fintech unicornRelocation supportAvailable to our hubs in Armenia, Georgia, Serbia, and Spain, including flights, temporary accommodation, and legal setup.