About the job
Must-Have Experience
- 3.5+ years backend development in Python.
- FastAPI (preferred) or Flask with Pydantic models;
OpenAPI/Swagger API design.
- Async programming (async/await), concurrency patterns, connection pooling, and
backpressure management.
- Database expertise with Amazon RDS (SQL Server/PostgreSQL) and Redis;
schema design, indexing, query optimization;
migrations with Alembic.
- Event-driven architecture experience using Amazon SQS and Amazon SNS.
- API security and integration design with OAuth2/OIDC, JWT, and rate limiting.
- Containerized application development with Docker (deployment managed by
platform team on Amazon EKS).
- Testing: pytest, fixtures, mocks/stubs, contract tests, and load testing (k6/JMeter).
- Observability: OpenTelemetry, Datadog, structured logging, and actionable alerts.
- Solid understanding of secrets and identity management with AWS Secrets
Manager and Okta.
Nice to Have
- Built streaming chat endpoints (SSE/WebSockets) and function/tool-calling
adapters for AI services.
- Worked with OpenSearch Serverless, Bedrock Knowledge Base, or other vector
databases for RAG workflows.
- Experience integrating backend systems with AWS Bedrock models and NeMo
Guardrails for runtime safety.
- Exposure to Kong API Gateway, feature flags (LaunchDarkly/Flipt), or policy-as-
code (OPA).
- Multi-tenant controls (RBAC, quotas, usage metering) and enterprise-grade
integration patterns.
- Familiarity with enterprise APIs (SAP OData/BAPI, Salesforce REST/Graph,
ServiceNow).
Tech Stack (our core;
equivalents welcome)
- Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.X, Alembic, pytest.
- Amazon SQS/SNS for messaging and events.
- Amazon RDS (SQL Server/PostgreSQL) for relational data, ElastiCache (Redis) for
caching.
- AWS Bedrock for model hosting and RAG services.
- Amazon S3 for data storage, Amazon ECR for container images.
- OpenSearch Serverless for vector storage and search.
- AWS Secrets Manager, Okta IAM for authentication and secrets.
- NeMo Guardrails on Amazon EKS for runtime safety.
- Datadog + OpenTelemetry for observability and monitoring.
- Kong API Gateway for service routing and access control.
- Custom RAG Service and LLM evaluation using Phoenix/Arize/Promptfoo.