👉 Subscribe to find your next opportunity by joining +5,000 remote workers and get 140 offers per week 🌎

Senior Site Reliability Engineer
Happy Money
Publication date: Nov 6th
Job type: Full Time
Category: Software Dev
View all Happy Money jobs

ABOUT THE ROLE


WE ARE OPEN TO 100% REMOTE CANDIDATES FOR THIS POSITION.


  • Design and write software to automate all things (even our one-offs are automated)
  • Become proficient in understanding how each software component, system design, and configuration is linked together to form an end-to-end solution
  • Align engineering development requirements with the capabilities of the infrastructure
  • Anticipate future infrastructure needs and offer solutions
  • Participate in the design, implementation and ongoing management for both software development and infrastructure operations
  • Solve infrastructure and development issues ranging from simple configuration changes to complex multi-variable performance problems
  • Drive requirements for cross-department automation and tooling
  • Serve in an on-call team and as an escalation contact for service trouble incidents
  • Design optimizations to meet the scalability and performance needs of the organization
  • Offer assistance in scaling and optimizing build and continuous integration systems
  • Enhance existing monitoring and reliability metrics across our platform
  • Participate in all phases of the software development life cycle, including deployment and support
  • Maximize software agility, maintainability, and extensibility
  • Minimize the cost of change, feedback time, and time to recover from problems


ABOUT YOU

  • 7+ years experience of Linux administration, configuration, and in-depth troubleshooting (Unix/Linux RHEL/CentOS or Ubuntu)
  • 5+ years administering mission-critical and large-scale, Internet-facing web applications
  • 2+ years with AWS, Azure, Google Compute, or similar “cloud” IaaS provider
  • 5+ years of system monitoring using (Nagios, Splunk, NewRelic, Sensu, etc.)
  • 3+ years of cluster administration (PostgreSQL,MySQL,Kafka/Spark/Hadoop/Mongo)
  • 3+ years of experience with Chef, Puppet, Ansible, SaltStack, or similar automation framework
  • Very strong programming ability in at least one scripting or shell language, such as Python, Ruby,  or Perl and Bourne/Bash shell script
  • Experience with Agile, Lean, and/or  test-driven Software development environments
  • Proven ability to scale Internet applications and systems horizontally
  • Experience with and daily use of SCMs, particularly Git
  • Disaster recovery planning and implementation
  • HandsOn Experience with networking for a cloud-based Internet application (load balancing, reverse proxies, DNS, CDN’s,  firewalls, security applications)
  • Have a great attitude and be ready to hustle


BONUS POINTS FOR

  • 2+ years writing recipes with Chef and/or test-kitchen integration tests
  • 2+ years of experience with continuous integration servers, such as Jenkins or Bamboo
  • Experienced with Information Security compliance, including SOC-2 or PCI compliance preparation
  • Experience with financial systems or Loan Origination Systems
  • Hands-on experience with Apache Kafka, Spark and/or Hadoop Stack
  • Hands-on experience with SQL and NoSQL DB variants, i.e. PostGres, MySQL, Mongo, Cassandra


WHY WE'RE AWESOME

  • Rich employee medical benefits offering—most paid 100% by Happy Money!
  • Unlimited vacation policy!
  • Unlimited snacks, coffee, teas—whatever you’re into, we’ve got it!
  • Weekly Happy Money Hour—we love to mingle and enjoy great brews!
  • Immigration sponsorship for qualified candidates

Please mention that you come from GetRemotify when applying for this job.