Infrastructure & Systems Engineer
Job Summary
Infrastructure & Systems Engineer
Bucharest, Romania | Hybrid
About the Company
We are building AI-powered industrial systems that combine cameras, GPU servers, networking and real-time software to support high-performance production environments.
We are looking for an Infrastructure & Systems Engineer to own and maintain the infrastructure behind these systems: servers, networking, monitoring, observability, deployment, backups, security and reliability.
This is a hands-on infrastructure role focused on operations, monitoring, troubleshooting and automation.
Core Mission
The mission is to ensure the reliability and stability of production infrastructure, proactively identify failure points and prevent incidents before they impact operations.
The role includes ownership over monitoring, alerting and observability systems, with focus on detecting hardware, server, storage, networking or performance issues in advance.
Responsibilities
Own and maintain Linux-based infrastructure: servers, GPU machines, networking, storage, backups and remote access
Ensure production systems run reliably through monitoring, troubleshooting and incident response
Build and maintain monitoring, logging, alerting and observability systems
Detect and prevent infrastructure, hardware and network issues before they affect production
Participate in on-call / incident-response rotations
Work with high-bandwidth networking, VLANs, PTP synchronization and rack-mounted infrastructure
Deploy and maintain applications, databases, virtualization services and supporting infrastructure
Automate repetitive operational tasks using scripting, Docker, Ansible, Terraform or similar tools
Manage VPNs, firewalls, certificates, credentials and access control
Create documentation, troubleshooting guides and operational procedures
Use AI tools such as ChatGPT, Claude or coding assistants to accelerate troubleshooting and scripting
Collaborate closely with software, research, hardware and support teams
Requirements
4+ years of experience in infrastructure, systems engineering, Linux administration, DevOps or networking
Strong Linux and networking knowledge
Experience with monitoring, observability and troubleshooting tools
Experience with Docker, databases and virtualization services
Scripting skills (Bash and/or Python)
Experience with Ansible, Terraform or similar automation tools
Good understanding of security, remote access and backup/recovery practices
Comfortable with production incident response and on-call responsibilities
Structured and documentation-oriented mindset
Fluent English
Nice to Have
Experience with GPU servers, ML infrastructure or real-time systems
Experience with industrial environments or high-bandwidth image/video systems
Exposure to cloud infrastructure
Work Model
Hybrid setup (Bucharest-based)
Approximately 1 day/week from the office
CIM or B2B
To Apply, send us your CV at office@staffingbysquill.com
