Position title
Site Reliability Engineer
Description

Viafoura partners with over 600 media brands to engage, convert and monetize their digital audiences. With best-in-class engagement and content moderation solutions — including real-time conversations, live blogs, community chat, personalization tools and AI-powered moderation — Viafoura helps companies create active, civil and loyal online communities. Advanced data analytics also offer customers access to unique and valuable insights into their audience's behaviors and preferences. As a result, the Viafoura solution drives higher registrations and subscriptions as well as better-targeted content and advertising.

We’re looking for a Site Reliability Engineer who is passionate about CI/CD, continuous improvement, and building high availability and reliability of our platform.  We’re looking for someone to bring in fresh ideas, who demonstrates a unique and informed viewpoint and enjoys collaborating with cross-functional teams including Product, Software Engineering, Customer Success and Data Science.

Our philosophy is to do the right thing at all times. We push boundaries to deliver a distributed, real-time API that handles tens of thousands of requests per second. We’re a dynamic and innovative team building a global, real-time platform utilizing modern technologies like Kubernetes, Docker, Java 11, Kafka, and Elasticsearch. As such, we’re on the hunt for team members looking to grow as an engineer and as a leader. If this sounds appealing, then read on!

Experience:

  • 3+ years experience in SRE, Infrastructure, and/or DevOps roles
Responsibilities
  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve our systems, processes, technologies and tooling
  • Provide primary operational support and engineering for multiple large distributed software applications
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation
  • Balance feature development speed and reliability with well-defined service level objectives
Qualifications

 

  • Bachelor’s degree in computer science or equivalent experience
  • Experience with AWS preferred, other cloud vendors (GCP, Azure..) acceptable
  • Experience with kubernetes, helm, docker
  • Experience with Terraform, CloudFormation or AWS CDK
  • Experience using MySQL, other RDBMS acceptable
  • Experience using Linux, including shell scripting
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Go, Java, and/or JavaScript
  • Experience developing high throughput, low latency systems
  • Experience with Gradle, Maven, Jenkins, Git
  • Well-versed in Continuous Integration and Continuous Delivery methodologies
  • Experience with Agile delivery methodology
  • Strong command of English language (both verbal and written)

 

Nice to haves:

  • Familiar with observability and monitoring (eg. AWS CloudWatch, DataDog)
  • Familiar with Microservices architecture
  • Appreciation for performance optimizations, especially in latency-sensitive (millisecond scale) environments
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Ability and desire to learn different skills outside of their domain of expertise
  • Ability and willingness to tech skills and share experiences with other team members
Job Benefits
  • Competitive Compensation
  • Comprehensive benefits
  • Professional Development
  • A highly supportive and collaborative Agile team culture
  • Choice of technology: Mac, Linux, PC and Abacus (limited)
  • A collaborative and open workspace
  • Full remote work option
Employment Type
Full-time
Job Location
Remote work possible
PDF Export
Close modal window

Position: Site Reliability Engineer

Thank you for submitting your application. We will contact you shortly!