Find A Job  ▶
Find Talent  ▶

Apply

Senior SRE Engineer

Santa Clara, California - Posted on June 26, 2024
Published By Kelly Champeau

Trillium Professional is now seeking Senior SRE Engineers in Santa Clara, CA!
 

Pay rate is $75 - $90/hour, depending on experience. Our client is looking for a seasoned SRE to join its multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew that develops and maintains sophisticated internal cloud provisioning product for GPUs and Tegra systems. The team works with various other business units within the company Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure & systems needs.

As an SRE, you’ll also be working in conjunction with various teams such as software engineering to deploy these new products and manage our infrastructure, associated processes and systems. Keen attention to detail, problem-solving abilities, and a solid knowledge base are essential.

What you’ll be doing:
-Working on systems deployed in company’s internal cloud making them available and reliable for our end users.
-Monitor system performance and troubleshoot issues related to CPU, memory, disk, and network utilization.
-Providing high quality of user support.
-Monitoring KPIs and making sure that team’s SLAs are met.
-Managing and maintaining production Kubernetes clusters.
-Drive automation of monitoring to gain more insight into applications and system health.
-Craft and develop tools needed for automating workflows.
-Develop, Improve and Maintain our infrastructure codebase.
-Craft and implement critical metrics using various analytics methods and dashboards.
-Take part in prototyping, crafting, and developing cloud infrastructure 
-Reuse AI techniques to extract useful signals about machines and jobs from the data generated.

Apply now!

What we need to see:
-Experience of maintaining cloud infrastructure and highly available production environment.
-Experience managing systems installed data centers. Proficient with BMC (Redfish), KVM, and IPMI tools.
-Working knowledge of Openstack.
-Background in Databases like SQL (MySQL) and timeseries DBs like Prometheus.
-Strong knowledge of networking principles and protocols, including TCP/IP, DNS, DHCP, and VLANs.
-Experience with data analytics/visualization tools like Kibana, Grafana, Splunk etc.
-Strong Ansible skills. Experience with Ansible AWX.
-Strong background with Jenkins and/or other CI/CD systems.
-Proficient with Kubernetes, dockers & virtualization.
-Proficient using source code management and binary repository systems like GitLab, GitHub, Artifactory, Perforce etc.
-Knowledge of monitoring systems such as Zabbix, Prometheus, -PagerDuty and/or similar systems.
-Advanced knowledge of standard methodologies related to security.
-5+ years of proven experience.
-Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent experience.

Ways to stand out from the crowd:
-Previous experience with SRE teams managing on-prem infrastructure.
-Experience managing company hardware like GPUs and Tegras.
-Thrives in a multi-tasking environment with constantly evolving priorities.
-Prior experience with large scale operations team.
-Experience with Windows server infrastructure.
-Outstanding interpersonal skills and communication with all levels of management.
-Experience with using and improving data centers.
-Ability to analyze sophisticated problems into simple sub problems and then reuse available solutions to implement most of those.
-Ability to design simple systems that can work efficiently without needing much support.


Trillium has been recruiting and placing clerical and office professionals for over 30 years. From Fortune 100 companies to small businesses, our philosophy remains the same: to achieve excellence by providing quality employees with an uncompromising level of service. We believe in honesty, integrity, and a simple philosophy of providing value to our customers and our employees. We strive to be unsurpassed in the recruitment and placement of professionals. Trillium is an Equal Opportunity Employer.

By applying to this job, I agree to receive electronic communications including SMS text and email regarding future opportunities, referral bonus incentives, and other promotions from Trillium. You may opt out at any time from future communications by responding STOP to any electronic communication. You may view our full privacy policy at https://trilliumstaffing.com/jobs/privacy/.

Trillium offers a comprehensive benefit package that includes the ability to participate in health insurance and retirement plans, paid holidays, state required leave, and vacation days. Trillium’s offerings are dependent on the state in which the assignment is located, length of time worked, and may change depending on assignment. Benefit packages for direct hire placements vary based on the client company.

Qualified applicants with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance and the California Fair Chance Act.

Want to apply for Senior SRE Engineer?

  • To apply for Senior SRE Engineer enter your email address below.

  • If you have an account with indeed.com, you can also

       

      Contact Us if you have any questions


      Contact

      Our intentions are to fill job vacancies as quickly as possible with qualified candidates. We are always accepting applications if a time sensitive job has an application deadline it is noted in the job description. Click on "Apply" to begin the apply process.

      Logo
      They have consistently met our needs by providing qualified employees in a very prompt time frame.
      Mark