Title: Principal Technical Duty Officer. (Also called: NOC Lead; Crisis Mgmt Lead; Availability Lead; Site Reliability Lead.)
Location: Onsite at client’s location in San Jose, CA
Summary: Looking for a commander who directs crisis resources in triaging and solving site issues. Must be technically savvy as well as having excellent communication and mgmt skills.
This is a 3 month contract (1099) to hire (w2) with a high salary upon conversion. We must have your resume by 8am PST Monday 7/12/10 to be considered for this position. Email resume to: fs010@vettanna.com
Primary Job Responsibilities:
Our well known client is looking for a Principal Technical Duty Officer with a passion for solving complex and interesting problems. The highly motivated individual needs to be an independent and initiative seeking individual.
- Candidate needs to manage client’s services very effectively to support high-volume and business critical applications.
- Candidate would need to work in a shift model and handle multiple customer impacting issues on the client’s Site.
- Candidate should be a leader on complex site incidents and drive issues to restoration.
- Proactive efforts to prevent site incidents from occuring.
- Independently implement and build tools and test major features and capabilities, as well as work jointly with other team members on complex features and complex site issues.
- Provide technical leadership and do technical hands on scripting, tooling, automation for continuous improvement and site restoration.
- Solid Unix systems administration and network adminstration experience is a must.
- Work with Architecture,Engineering,and Operations teams to develop innovative solutions to attain high availability scalability and reliability.
- Apply technical & domain expertise to solving day to day challenges.
Job Requirements:
- Excellent analysis, design, and problem-solving skills.
- Willing to work both day and night shifts.
- Proven crisis management leadership ability working with cross functional teams and executive management
- Effective communication skills are a must.
- Candidate should have excellent crisis management and incident management skills.
- Candidate should be very effective in working under stressful situations when the systems and components are in an unstable state.
- Good experience building highly-automated infrastructure.
- Proven results-oriented engineer in a high-velocity and dynamic environment is a must.
- Experience with Virtualization, open source software is a plus.
- Advanced knowledge of operating system internals, file system structures, and machine architectures in a UNIX/Linux environment.
- Excellent knowledge of Networking, Unix, Windows, Load balancers, Mail, DNS, TCP/IP and SAN technologies.
- Advanced level scripting is a plus.
- Working knowledge of relational databases (Oracle preferred), Apache, HTTP/HTML, XML, XSLT is a plus.
- Demonstrated leadership in technologies that are core to large and distributed systems like cloud computing, web services, multi-tier serving architectures etc.
Basic Qualifications:
Bachelors Degree or Equivalent. Computer Science Degree is preferrable.