ROLE DESCRIPTION

Location: Philadelphia, PA

Lead Site Reliability Engineer

As a Lead Site Reliability Engineer, you are someone on a mission to design and automate performant and highly available infrastructure, and leading people to work the same way. That's great, because it fits perfectly into our view of infrastructure: adaptive, flexible, reconfigurable to the needs of a dynamic business.  We are Power Home Remodeling, a nationwide home remodeler that thrives in a technology-first world. We need your help to make us faster and more dynamic while delivering a technology platform that never stops.

 

Power Home Remodeling is an established, profitable, and growing company based out of Philadelphia. As the technology department of this company, our mission is to deliver a platform for applications, enabling developers to rapidly iterate, supporting a changing business as quickly and efficiently as possible. To date we've put a lot of effort into making our server infrastructure automated with tools like Puppet, Terraform, and Consul. We are now moving into the next generation with containers, enabling rapid development of new services. We're also growing: as we continue to open new territories, we want to ensure that the system capacity is managed for the increased load and monitored for consistent availability. This is where you come in.

 

If you're the ideal candidate, then these are skills that you either know today or are excited to learn:

  • Private cloud technologies, especially Kubernetes & Docker
  • Virtualization technologies like VMware or OpenStack
  • Programming in at least one language, preferably Ruby, Python, or Go
  • Continuous Integration and Continuous Delivery
  • PaaS and/or IaaS
  • SDN and/or NFV
  • Multi-site WAN links, fiber and VPN
  • Routing & switching, wireless networks
  • Operational experience in high-availability environments
  • Appreciation for agile methodologies such as Scrum
  • Technical leadership and mentoring

 

Duties and Responsibilities

  • Drive development of important, revenue-supporting projects while maintaining a stable, mission-critical infrastructure
  • Work as part of a team on stories as described in our agile workflow management tool
  • Participate in our Scrum process with weekly planning, daily standups, and weekly demos
  • Take feedback from testers and users to identify and resolve infrastructure problems
  • Employ automation tools like Puppet and Ansible to build out new servers and provision networking
  • Handle escalated support requests for infrastructure issues
  • Work a rotating On Call shift to handle after-hours requests
  • Diagnose application failures in staging and production environments, understand the underlying issues, and resolve them
  • Maintain knowledge of networking tools and best practices
  • Sharing your knowledge with teammates, increasing the team's capacity through shared expertise


Technology
Our stack currently includes the following technologies, but is constantly evolving. Knowledge of these specific technologies is preferred, but we’re equally interested in someone who can learn new technologies quickly, perhaps being the first on the team to use them.

  • Network Platform: Cisco Nexus
  • Server Operating System: Ubuntu Linux
  • Virtualization: Kubernetes/Docker, VMware
  • Storage: NAS, DAS, Object Storage (S3/NetApp)
  • Automation: Ansible, Puppet, Jenkins, Terraform
  • Web application: Ruby on Rails
  • Databases: MySQL, Redis, LDAP, ElasticSearch
  • Telephony: Adhearsion, Asterisk
  • Messaging & PubSub: XMPP (ejabberd), Redis
  • Organizational tools: Scrum, Pivotal Tracker, Git, Github


Education and Experience

  • 5+ years of relevant industry experience, especially with Linux
  • Proven track record of delivering incremental value continuously
  • Experience leading on small teams desirable

Location
Corporate Headquarters, outside of Philadelphia, PA

 

Benefits

  • Competitive salary
  • Full medical, dental, life and disability insurance plans that can be tailored to your specific needs and the needs of your family
  • A competitive 401(k) retirement savings program matched by Power
  • All the tech you need - we'll pay for whatever hardware and software you need to work and make sure you're regularly upgraded to the latest versions
  • Personal development - we provide books, courses and conferences
  • Paid parental leave - when the time comes to welcome a new member of the family, we offer paid parental leave
  • Paid volunteer time off
  • 3 events per year focused on internal development and improvement
  • Artfully designed office space
  • Free food and entertainment every first Friday of summer

 

 

Accolades for Power
One of FORTUNE's Top 100 Companies to Work For (2019)
#15 Glassdoor.com Employee’s Choice Best Place To Work (2019)
#6 Workplace for Millennials by Fortune Magazine (2018)
#4 Entrepreneur’s Top Company Cultures List for Large Companies (2018)
Tim Wenhold - CIO of the Year Finalist (2018)
#17 ComputerWorld Best Places To Work in IT (2017)
One of Inc. 5000’s Fastest Growing Private Companies
Philly.com Top Workplace for Midsized Companies

APPLY NOW

Max file size 10 MB.