Thursday, April 8, 2021

[xrecnet] Direct Client : Hadoop Admin - Architect - Remote

 

 

 

Hi

 

This is Aravind, - Recruitment Team from MSR Cosmos

We have an urgent requirement as follows:

Please respond with resumes in MS-Word Format with the following details to 
aravind@msrcosmos.com

Full Name :
Location :

Relocation :
Contact Number :
Email

Skype Id :

Last 4 digit SSNO :
Availability for project :

Availability for Interviews :
Visa Status and Validity :

D O B :

Years of Exp :

 

Requirement Details

 

Hadoop Admin – Architect

Remote

 

Position 1:
Start date:  4/19

Duration:  40 days

Lead engagement types: HDP Cluster Health Check / Tuning / Optimization

Migration HDP to CDP Private Cloud Base

Hbase, Phoenix skills are critical for this engagement.   

Scope:

Health Check Assessment for Two (2) HDP 3.1 Clusters (Estimated Duration: 2 weeks) Cloudera will work with Customer to perform health check on two (2) Hortonworks Data Platform (HDP) Customer environments including: ● Cluster hardware, OS, network and physical topology configurations ● Cluster and component designs, configurations including HDFS, HIVE, Spark, Phoenix and HBase etc. ● Data security components (Ranger, Sentry, Navigator, Atlas and Knox etc.) configurations ● YARN queue/resource utilization and NameNode resource usage ● Review Security implementation (platform and data) ● Known cluster issues, log analysis and available patches/hotfixes ● High availability, DR and backup / restore configurations ● Current cluster capacity and growth strategy for capacity planning keeping in view current workloads, and the planned workloads ● Provide findings and recommendations, in order of priority Scaling, Optimizing and Query Tuning Recommendations for One (1) HDP Cluster (Estimated Duration: 4 weeks) ● Prioritize the focus areas based on the outcomes of health check assessment above ● Review data ingestion architecture, and patterns for landing data into HBase ● Review current Phoenix and HBase workloads, as applicable ● Provide recommendations for scaling the workloads ● Provide architectural recommendations and sizing guidance ● Review specific queries that have performance issues ● Assist in root cause analysis for issues with Phoenix and HBase (query errors, slower queries) ● Tune and provide recommendations to optimize identified queries ● Mentor Customer team in Phoenix optimization, and provide best practices guidance ● Platform stabilization best practices guidance ● Provide documentation and knowledge transfer Cloudera Data Platform (CDP) Migration/Upgrade Planning for One (1) Cluster (Strategy, Solution, Estimate, Resources and Pre-Reqs) (Estimated Duration: 2 weeks) Cloudera will work with the Customer to review one environment for identifying the best possible path to get to CDP. Migration assessment includes the following activities at the high level: ● Completing the current state discovery questionnaire ● Complete upgrade risk assessment questionnaire ● Identify potential impacts to the current use cases ● Understanding customer's cloud strategy

 

 

Position 2:

Start date: 4/19

Duration: 10 days

Lead engagement type:   Migration HDP to CDP Private Cloud Base 

 

Scope: 

1.1 Current state analysis and review pre-requisites • Review existing use case implementation including code, data model, data types, storage, ingest, egress and visualization. • Review scope requirements including security and success criteria of the engagement and gain common understanding with Customer to determine which nodes will be ring-fenced for the new target CDP-PVCB 7 .1.x cluster( s). 1.2 Solution Design and Implementation Planning • Support Customer cluster and component design including security. • Support Customer in developing & documenting application code migration strategy & plan. • Build end to end implementation and testing plan. 1.3 CDP DC Cluster Installation and Security Cloudera Confidential T&M Statement of Work Page 1 of 10 v8.2 - 082019- DocuSign Envelope ID: 87968B6C-ED4A-456D-A3D3-32C81 ABEE589 Order No: CLD-202008-256309 CLOUD=RA • Decommission servers from the Preproduction and Production clusters to form the basis for a new CDP-PVCB 7 .1.x cluster. • Install and configure CDP-PVCB clusters using the decommissioned nodes. • Deploy security including Kerberos and AD integration for authentication. • Configure data governance including Ranger and Atlas for authorization and auditing. • Execute initial benchmarks and document tuning requirements. • Cluster performance baseline using standard scripts. • Validate cluster installation, support troubleshooting of issues, if any. • Provide high level overview of cluster and monitoring best practices. • Develop cluster operation run book documentation. 1.4 Application Migration Support • Assist in migrating necessary data sets from existing Customer environment to CDP DC Cluster. • Assist in migrating application workloads after necessary refactoring based in Cloudera best practices. • Develop scripts to move data from old source HDP 2.6.1 environment and the new CDP PVCB target. • Develop scripts to determine differences in the Hive Metastore between the source and target. • Mentor Customer team to operate cluster to support application in CDP environment.

 

Position 3:

Start date:  4/26

Duration: 30 days

Lead Engagement type:  CDH Cluster Health Check / Tuning / Optimization

Candidates must have strong Solr skills

Scope:

1.1 Phase 1: Cloudera will conduct a health check for two CDH v6.x Clusters namely STL and KC. A. Review and Assess Cluster & Component Configurations Cloudera will review Cluster and component configurations including cluster hardware, OS, number of Nodes, Node specifications, and network architecture. Cloudera will run standard performance check to detect bottlenecks and evaluate current scheduler configurations. Cloudera will recommend configuration changes required based on the specifics of cluster deployment, Customer requirements or benchmarking results. and suggest improvements. Below are the high-level activities: • Current use case implementations and ones planned for the future • Cluster hardware, OS, network and physical topology configurations • Cluster and component designs, configurations including HDFS, Solr, HIVE, Spark, Impala etc. • Data security components (Sentry, Navigator etc. ) configurations • YARN queue/resource utilization and NameNode resource usage • Known cluster issues, log analysis and available patches/hotfixes • High availability, DR and backup / restore configurations • Current cluster capacity and growth strategy for capacity planning B. Review and Assess Data lake Schema & Application Implementation Cloudera will evaluate Cloudera data lake and application implementation based on data volumes, data processing job, analytic processes, and data publishing SLAs. Below are the high-level activities: • Overall solution design and application architecture for use cases • Data lake schema, including data organization, data format & layout (partitioning, sorting, bucketing, etc.) • Data ingestion/ETL, data engineering, egress and analytics processes • Solr implementation reviews, analysis of any 3rd party or custom application integrations • Existing SDLC processes including CI/CD and automation • Known issues, challenges and Cloudera recommended practices C. Review and Assess Cluster Security & Data Governance Implementation Cloudera will review the current Cloudera cluster security implementation including data governance and perimeter security. Cloudera will recommend the configuration changes required based on the specifics of cluster deployment, use case implementation and customer security requirements. Below are the high-level activities: • Understand Customer's security and governance requirements • Security implementation including o Authentication with Kerberos and AD integration o Authorization with Ranger or Sentry o Auditing with Ranger plug ins • Current design and implementation of data security access policies • Current design and implementation of perimeter security using Knox including data exposed via Knox to external applications/APIs • Data encryption with both data in motion and data at rest • Data governance implementation including lineage and data tagging D. Review and Assess Cluster Operation Practices Cloudera will review current cluster operation practices including cluster management, cluster maintenance and cluster troubleshooting. This review will include an evaluation of cluster deployment and configuration management tools, monitoring practices, and alerting configuration. Below are the high-level activities: • Review day-to-day cluster operation processes including monitoring, alerting, log rotation etc • Enterprise Operations Integration related to escalation and access to knowledge base • Review team's daily operation activities, organization alignments and workload allocations • Assess operations skills of the team to develop a training/coaching and knowledge transfer plan • DR , backup and recovery plan, Capacity planning and patching/upgrade cadence • Understand current issues to help build troubleshooting best practices • Analysis of cross-team relationship, interaction processes and skills/readiness evaluation • Discuss best practices including multi-tenant data lake, high-volume/high-speed data ingestion, scalability, capacity planning, DR/Backup, cluster monitoring, management and automation E. Optimization Recommendations and Plan for Implementation Based on the assessment Cloudera will develop optimization recommendations and review with Customer in defining the implementation plan. Cloudera to create/update cluster Operational Runbook, conduct knowledge transfer and provide coaching to Customer team members. 1.2 Phase 2: Implementation of health check analysis recommendations A: Finalize the Implementation Plan Customer and Cloudera collectively review the recommendations from health check analysis and mutually agree in finalizing the implementation plan. Identify the tasks for Cloudera to implement in 3 weeks duration. B. Implementation of recommendations Cloudera to work with Customer in implementing the items mutually prioritized for the 3 weeks duration 1.3 Deliverables Phase 1 Deliverables: Health check analysis and recommendations document Phase 2 Deliverables: Implementation design and documentation  

 

 

 

Stay Safe

 

Thanks & Regards, 

Aravind
MSR COSMOS

6200 Stoneridge Mall Rd, Ste 300, Pleasanton, CA - 94588

Desk : 925 399 7145

Textnow : 732 574 5974

Fax : 925-219-0934

Email aravind@msrcosmos.com   
URLhttp://www.msrcosmos.com

https://media.licdn.com/mpr/mpr/shrink_200_200/AAMAAgDGAAoAAQAAAAAAAA75AAAAJGQ0MTUxMjliLWVhN2YtNDM1Zi05YzkxLTFhZWE1NjcyYTlkYQ.png

-          Microsoft Gold Partner

-          SAP Silver Partner

-          Oracle Gold Partner

-          Hortonworks Silver Partner

-          Cloudera Silver Partner

-          E-Verified

-          WBE Certified

 

 

Note: This email is not intended to be a solicitation.  Please accept our apologies and reply in the subject heading with REMOVE to be removed from our Mailing list.

 

Any information and documentations including Govt issued ID or government issued documents if forged, manipulated or Falsified is considered as a felony as per US laws Such actions are a punishable offense and can lead to Criminal Investigation or Indictment

 

Confidentiality Notice: 

Unless otherwise indicated, email transmission is not a secure form of communication and your reply may not be encrypted. The information contained in this message is proprietary and/or confidential. If you are not the intended recipient, please: (i) delete the message and all copies; (ii) do not disclose, distribute or use the message in any manner; and (iii) notify the sender immediately.

 

--
You received this message because you are subscribed to the Google Groups "Xrecnet IT Recruiters Network - Corp to Corp IT Jobs & Hotlists" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xrecnet+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xrecnet/CAMSx0H3gFeMG%3DDa79Rkyemsoh-st1wN5tVQyHr%3DbQGO_Xc840A%40mail.gmail.com.

No comments:

Post a Comment