Wednesday, August 28, 2024

[xrecnet] Site Reliability engineering (SRE) WITH JAVA , SPLUNK AT sanleandro , CA - NEED LOCALS ( USC/CITIZENS)

 I CAN SUBMIT ONE QUALITY PROFILE WHO IS GC/CITIZENS AND LOCAL TO CALIFORNIA

 

 

 

Site Reliability engineering (SRE)  WITH JAVA , SPLUNK  AT sanleandro , CA

- NEED LOCALS ( USC/CITIZENS)

Location       Candidate must be willing to come San Leandro/SF office 3 days week. Primary location is SL/SF office.

 

 

Job Description  I am also open for good Java Dev background interested in this role with strong hands-on experience in building dashboards and setting up alerts using Splunk, Grafana and GCL.

 

Skill (Primary):      Modern Application Development-DevOps (Modern AD)-Site Reliability engineering (SRE)

I am looking for good hands-on strong Site Reliability engineer with below skill set.

 

Please share resumes with Site Reliability background or Dev background open for this role  

Note : Need SRE candidate with good Java Dev background interested in this role with strong hands-on experience in building dashboards and setting up alerts using Splunk, Grafana and GCL.

 

 

 Required Qualifications: 10+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education 10+ years of experience in Production support/Site Reliability Engineering teams with continued focus on improving Platform health Familiar with Agile or other rapid application development practices Hands-on expertise with Automated testing, Process Automation building dashboards using APM tools. Experience with distributed (multi-tiered) systems, algorithms, relational databases, and NoSQL databases. Knowledge & Exposure caching tools (Redis, memcache) or messaging tools such as MQ, Kafka. Must have working knowledge of APM tools such as splunk, GCL, ELK, Grafana, Prometheus etc. Able to create Dashboards using GCL/Splunk/ELK and setup alerts. Working knowledge of CICD is a plus Source control like Git, Continuous Integration Jenkins / UCD Release etc. . Ability to work with Engineering teams across the ecosystem such as Security, Networking & Infrastructure challenges which can impact platform health & resiliency. Shell Scripting / DevOps tools like Ansible with good knowledge of yaml file to write playbooks . Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / OpenShift, AWS or Azure. Tech Stack: Java/J2EE (Spring, Spring Boot, Python, Shell Scripting, Kafka, Oracle, MongoDB etc.). Able to work on shift duty in a 12/7 support organization. A proactive approach to spotting problems, areas for improvement, and performance bottlenecks. Bachelor s Degree in computer science, computer science engineering, or related experience required; Job Expectations: You will be a core member of a SRE support team, will be utilizing the latest technology tools to write code, test cases, working with API specs and automate to maintain the resiliency, performance and availability of Digital Sales & Marketing platforms. Strong & relevant experience in supporting Web/API platforms built using Java/java script Stack (Spring/Spring boot, Javascript -Angular/react) Proficiency in dealing with Legacy infrastructure along with cloud infrastructure (on prem & 3rd party) such as PCF or Azure. Identifying opportunities to adopt to new technologies while improving the efficiency by removing toil and continues to drive efficiency & optimization. Proactive monitoring of app performance through Splunk, App dashboards, App dynamics & Dynatrace etc. Represent Platform engineering teams during production outages and collaborate with engineering teams to resolve production outages. Collaborate with stake holders across engineering function to own/derive RCA & work towards permanent resolution. Plan, support, execute and comply with governance programs/processes in support of a strong control environment in your functional area. Leverage process documentation  

 

 

Required Qualifications:

10+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

10+ years of experience in Production support/Site Reliability Engineering teams with continued focus on improving Platform health 

Familiar with Agile or other rapid application development practices

Hands-on expertise with Automated testing, Process Automation & building dashboards using APM tools.

Experience with distributed (multi-tiered) systems, algorithms, relational databases, and NoSQL databases.

Knowledge & Exposure  caching tools (Redis, memcache) or messaging tools such as MQ, Kafka.

Must have working knowledge of APM tools such as splunk, GCL, ELK, Grafana, Prometheus etc.

Able to create Dashboards using GCL/Splunk/ELK and setup alerts.

Working knowledge of CICD is a plus – Source control like Git, Continuous Integration – Jenkins / UCD Release etc. .

Ability to work with Engineering teams across the ecosystem such as Security, Networking & Infrastructure challenges which can impact platform health & resiliency.

Shell Scripting / DevOps tools like Ansible with good knowledge of yaml file to write playbooks .

Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / OpenShift, AWS or Azure.

Tech Stack: Java/J2EE (Spring, Spring Boot, Python, Shell Scripting, Kafka, Oracle, MongoDB etc.).

Able to work on shift duty in a 12/7 support organization.

 

 

Job Expectations:

You will be a core member of a SRE support team, will be utilizing the latest technology tools to write code, test cases,  working with API specs and automate to maintain the resiliency, performance and availability of Digital Sales & Marketing platforms.

Strong & relevant experience in supporting Web/API platforms built using Java/java script Stack (Spring/Spring boot, Javascript -Angular/react)

Proficiency in dealing with Legacy infrastructure along with cloud infrastructure (on prem & 3rd party) such as PCF or Azure.

Identifying opportunities to adopt to new technologies while improving the efficiency by removing toil and continues to drive efficiency & optimization.

Proactive monitoring of app performance through Splunk, App dashboards, App dynamics & Dynatrace etc.

Represent Platform engineering teams during production outages and collaborate with engineering teams to resolve production outages. Collaborate with stake holders across engineering function to own/derive RCA & work towards permanent resolution.

Plan, support, execute and comply with governance programs/processes in support of a strong control environment in your functional area. Leverage process documentation to improve operational controls and identify and remediate process deficiencies.

Proactively identify, communicate, mitigate and escalate risk originating from non-compliance of processes, operational errors, and data integrity issues in all applicable processes.

Ability to influence SRE practices within and outside teams to enable a strong DevOps culture within the organization

Able to work on shift duty in a 12/7 support organization.

Responsible for working with Engineering teams to maintain the SLAs & SLOs. Constantly looking out for opportunities to improve platform metrics & communicate the same to stakeholders.

Exposure and proficiency in different API styles such as SOAP, REST, Micro services etc.

Working knowledge of Unix, Linux and Postman

 

 

 

 

 

 

 


Regards

Pavan

VDart Inc

Ph: (470) 251-2584 Ext:1866

Email: Pavankumar.s@vdartinc.com

Website: https://vdart.com

 

VDart Inc Email Logo

 

Text    Description automatically generated

 

Confidentiality Notice

 

The information contained in this message may be privileged and confidential and protected from disclosure. If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by replying to the message and deleting it from your computer.

 

 

--
You received this message because you are subscribed to the Google Groups "Xrecnet IT Recruiters Network - Corp to Corp IT Jobs & Hotlists" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xrecnet+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xrecnet/CAGf9M0-J0zX9DLqrkgopbBaFQ18mPVwOo-WxdqQOcPL_YA0o0Q%40mail.gmail.com.

No comments:

Post a Comment