Senior or Lead Site Reliability Engineer (SRE), US Citizen -TS/SCI Clearance Re
Herndon, VA  / Charleston, WV  / Reston, VA  / McLean, VA  / Virginia Beach, VA ...View All
View Less
Share
Posted 21 days ago
Job Description

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job CategoryProducts and Technology

Job Details

PLEASE NOTE: Qualification for this job is contingent upon acceptable results from a background investigation as well as your obtaining and maintaining the specific level U.S. government background investigation required for this role.

Title: SRE (level based on experience)

Location: Onsite at client, Northern Virginia

Cloud: Public/GovCloud-Blackjack

Salesforce is seeking an engineering candidate to join the Site Reliability organization. Working closely with counterparts in the Infrastructure and R&D organizations, this organization provides a team of engineers monitoring cloud service availability and ready to swiftly repair any service-impacting issues. Seven days a week, 24 hours a day, the Site Reliability team keeps the Salesforce cloud and our customers protected. As a member of the Site Reliability team, you will be responsible for the primary task of detecting and resolving incidents within minutes. This objective is met by monitoring the services, reacting to problems, and proactively addressing issues before they affect performance or availability.

The team is responsible for fire prevention through monitoring, automation, self-healing and resiliency initiatives, destructive testing, and game day exercises. The incumbent in this role would demonstrate a strong focus on tactical operations, as well as large-scale production engineering and orchestration.

Role Description:

  • Keep the customer-facing services available at top performance by maintaining the constant health of the supporting systems.

  • Incident management - Act in key support roles during major incidents e.g. Sev0, Sev1. Also, participate in the technical review of the incident for problem management

  • Problem Management - populate and participate in RCAs and hand them off to the Global Solutions team

  • Ensuring that work carried out by the Site Reliability team is executed in such a way as to comply with the company's internal compliance policy and directives

  • Being available to discuss and resolve technical issues and escalations with other technical staff as the need arises

  • Work with and lead other members of the team in staying on top of key industry innovation and technology, and assist in team development growth

  • Ability to operate in the fast paced environment and troubleshoot complex issues quickly successfully balance multiple priorities

  • Work to automate detection and resolution of recurring issues in the production environment

Basic Requirements:

  • Active TS/SCI clearance

  • Systems engineering experience in enterprise scale internet service engineering or support role

  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.)

  • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD) as well as strong Linux/UNIX knowledge with significant exposure to Red Hat Enterprise Linux and Solaris

  • Strong understanding of monitoring implementations and administration

  • Strong communication skills (Written and Oral)

  • Past experience in Incident Management and good understanding of ITIL service operations

  • Experience in working in a 24/7 team managing large data centers

  • Be available to work shift work if required

  • Experience provisioning, operating, and managing AWS/C2S based infrastructure and systems

  • Understand and have experience with writing scripts in Python, Go, or other languages


Preferred Qualifications:

  • BS or higher degree preferred in Computer Science or Electrical Engineering plus relevant job-related experience

  • Prior Chef/Puppet or automated deployment experience

  • Prior Jenkins/Bamboo/Spinnaker pipeline execution experience

  • Experience in supporting and maintaining a monitoring and alert systems

  • Experience in supporting and maintaining Java applications

  • Hands on experience configuring and managing AWS (Amazon Web Services), using the CLI/SDKs

  • Experience managing systems monitoring and alerts.

  • Have or Obtain Certifications in Linux+, RedHat and AWS

  • Experience in supporting and managing Kubernetes based applications and services

  • Familiar with Agile Process and DevOps

*LI-Y

Qualification for this job is contingent upon acceptable results from a background investigation as well as your obtaining and maintaining the specific level of U.S. Government security clearance required for this role. U.S. citizenship

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this .

Posting Statement

At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at Salesforce and explore our benefits.

and are Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. and do not accept unsolicited headhunter and agency resumes. and will not pay any third-party agency or company that does not have a signed agreement with or .

Salesforce welcomes all.


Salesforce.com and Salesforce.org are Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Headhunters and recruitment agencies may not submit resumes/CVs through this Web site or directly to managers. Salesforce.com and Salesforce.org do not accept unsolicited headhunter and agency resumes. Salesforce.com and Salesforce.org will not pay fees to any third-party agency or company that does not have a signed agreement with Salesforce.com or Salesforce.org.

 

Job Summary
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields