General Dynamics Information Technology Senior Monitoring Engineer in Arlington, Virginia
Responsible for the design, development, deployment, and maintenance of infrastructure, application and business service monitoring, alerting, and reporting. The Senior Monitoring Engineer is engaged in all phases of the project lifecycle which include; gathering and analyzing user/business system requirements related to proactive and reactive monitoring and event creation, solution design and development, implementation, deployment of the user community, and operations and maintenance activities. Other duties include, but not limited to, operational activities related to Service Request and Incident and Problem Management. Proactively identifying gaps and creation of monitoring before an Incident occurs. Reactively responding to Incidents and remediating gaps where monitoring does not exist. Is responsible for improving processes, systems or products to enhance performance of the job area. Can adapt and embrace new perspectives and approaches on existing problems. Communicates with parties within and outside of own department, with the ability to educate others on the event management lifecycle. Works to influence parties within and outside of the job function at an operational level regarding policies, practices and procedures.
Technical Analysis Expertise: Understands how to gather and analyze data flows from various sources. Expected to lead and deliver complex network and service monitoring analysis, design strategies and solutions to meet operational requirements. Possess understanding of the building blocks, interactions, dependencies, and tools required to complete design work. Ability to translate complex technical issues in terms of business impact and business benefits.
Technical Leadership: Collaborates with technical teams and utilizes system expertise to deliver solutions. Continuously learns and teaches others existing and new technologies. Contributes to the development of others through mentoring or in house workshops. Influence technology and policy decisions made across the appropriate functional organization around architecture, design strategies and standards.
Technical Writing: Contributes clear documentation on multiple systems and services used. Able to document systems architecture, design strategies, standards, business requirements and technical interpretation. Develops and provides technical data for incorporation into internal presentations that may translate technical issues into business impact.
Responsible for the full lifecycle management of the monitoring applications to include maintaining the accreditation and security posture, implementing patches and upgrades, supporting solution design and projects, monitoring the health and status of the components supporting system availability and integrity.
Provide first and second level of support, analysis and trouble-shooting to resolve issues with event monitors. Offer expert tool advice to the troubleshooting team in quickly isolating and resolving incidents. Help drive reduction in Mean Time to Repair (MTTR) metric.
Think proactively and close gaps in current monitoring landscape to prevent incidents. Provide recommendations for improvement to reduce false-positives in alerts.
Provide written technical recommendations to improve monitoring capabilities.
Other duties as assigned.
- Bachelors Degree in Computer Science, Engineering, or a related technical discipline, or the equivalent combination of education, technical certifications or training, or work experience.
The successful candidate will have 14 plus years of relevant job experience in enterprise network and application monitoring as well as, but not limited to, the following skillsets:
Bachelor’s degree in Information Technology or equivalent
10 plus years’ experience engineering and operating enterprise monitoring platforms
Demonstrated proficiency as a LINUX/UNIX systems administrator
Demonstrated proficiency with coding and scripting to support process automations and fostering solid DevOps principles
Strong knowledge in Netcool Omnibus, Impact, ITNM, ITM and DASH
Knowledge and experience in architecting, design and implementation of IBM Enterprise Monitoring solutions
Excellent knowledge in Event Management process and should have experience in Integrating ITM, Netcool and ITNM
Strong analytical and problem-solving skills.
Proven experience and ability to manage problem resolutions of complex or intermittent issues in a multi-vendor, integrated enterprise environment
The ability to demonstrate depth of knowledge and skill in networking and network technologies
The ability to demonstrate concern and meet external and/or internal customers’ needs
Strong oral and written communications skills
Self-motivated and able to work well under pressure
Ability and desire to work cooperatively with others on a team
Active Secret clearance
Other desirable experiences: - BMC Remedy - IBM Application Performance Management - IBM Netcool/NOI - InfoVista - Riverbed - SolarWinds - Splunk - Packet analysis with sniffer or similar technologies
Offer contingent on certification verification and successfully completing customer on boarding.
For more than 50 years, General Dynamics Information Technology has served as a trusted provider of information technology, systems engineering, training and professional services to customers across federal, state, and local governments, and in the commercial sector. Over 40,000 GDIT professionals deliver enterprise solutions, manage mission-critical IT programs and provide mission support services worldwide. GDIT is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status, or any other protected class.
Number of Positions1
Job FunctionInformation Technology
Security Clearance LevelSecret
Full/Part TimeFull Time