
201 - 500 employees
Founded 2011
đ€ B2B
đĄ Telecommunications
đ§ Hardware
B2B âą Telecommunications âą Hardware
CodiLime is a software and network engineering services company that partners with networking hardware vendors, software providers, and telecommunications firms to create proofs-of-concept, develop new products, and support production environments. Founded in 2011 and grown to 300+ employees, the company specializes in network automation, low-level systems programming, observability, DevOps, and cybersecurity, serving clients worldwide (US, Japan, Israel, Europe). CodiLime primarily operates as a B2B service provider to tech startups and large industry players.
đ„ 0 minutes ago
đ”đ± Poland â Remote
đ” zĆ16k - zĆ25k / month
âł Contract/Temporary
đ Senior
đ€ AI Engineer
Improve your chances of getting an interview by checking your resume score before you apply.

201 - 500 employees
Founded 2011
đ€ B2B
đĄ Telecommunications
đ§ Hardware
B2B âą Telecommunications âą Hardware
CodiLime is a software and network engineering services company that partners with networking hardware vendors, software providers, and telecommunications firms to create proofs-of-concept, develop new products, and support production environments. Founded in 2011 and grown to 300+ employees, the company specializes in network automation, low-level systems programming, observability, DevOps, and cybersecurity, serving clients worldwide (US, Japan, Israel, Europe). CodiLime primarily operates as a B2B service provider to tech startups and large industry players.
âą Developing MCP-like tools that expose network device APIs and CLI commands with clear descriptions, structured inputs/outputs, validation logic, and error handling âą Managing tool metadata and supporting semantic search over available tools using a vector database âą Creating golden user queries, expected answers, and query variations for specific tools, intents, and network-operation scenarios âą Building automated tests to verify correct tool selection, tool parameterization, output structure, and end-to-end agent responses âą Designing evaluation workflows combining deterministic checks, human review, and LLM-as-a-judge techniques âą Refining prompts, tool descriptions, schemas, and agent workflows while monitoring regressions when new tools or changes are introduced âą Developing production-quality Python code and tests
âą Hands-on experience with LLM-driven workflows, agentic frameworks such as LangChain and LangGraph, and tool-calling patterns âą Experience designing structured tools with clear descriptions, input/output schemas, validation logic, and integration with external APIs or command-based systems âą Experience with semantic search, vector databases, RAG patterns, prompt engineering, and structured LLM outputs âą Experience creating golden queries, automated tests, regression checks, and chatbot/agent response evaluations, including LLM-as-a-judge approaches âą Proven experience developing production-quality Python code, including automated tests and maintainable integration logic âą CCNA certificate or equivalent knowledge. Understanding of networking platforms, device commands, and troubleshooting âą English (B2 level at minimum, but preferably C1 or C2)
âą Flexible working hours and approach to work: fully remotely, in the office or hybrid âą Professional growth supported by internal training sessions and a training budget âą Solid onboarding with a hands-on approach to give you an easy start âą A great atmosphere among professionals who are passionate about their work âą The ability to change the project you work on
Apply Now