Alexander George Consulting Services Limited is your strategic partner in talent acquisition and workforce solutions. With a deep understanding of the dynamic job market and a passion for connecting exceptional talent with outstanding opportunities, we are your go-to recruitment specialists, committed to helping organizations thrive and individuals succeed. As a leading Human Resources firm, we combine expertise, innovation, and personalized service to deliver exceptional results for our clients. We are catalysts for building high-performing teams and shaping successful careers, driving growth and excellence for organizations and individuals. We aim to provide exceptional recruitment solutions that connect top-tier talent with forward-thinking organizations, creating mutually beneficial partnerships that fuel success.
We are recruiting to fill the position below:
Job Title: Network and Infrastructure Engineer
Location: Remote
Job Type: Part-Time, Individual Contributor
Reports to: Lead Consultant
About the Role
We are seeking an experienced Network and Infrastructure Engineer with expertise in on-premises infrastructure, networking, and competitive analysis. In this role, you will set up, monitor, and maintain on-premises servers and network systems, using observability tools to ensure reliability and performance.
You will also conduct competitive analysis and research on digital products using observability tools, providing insights that inform our product strategies and competitive positioning.
This is a part-time, individual contributor role, working remotely, and reporting directly to the Lead Consultant.
Key Responsibilities
Set up, configure, and manage on-premises servers and network infrastructure, ensuring performance, security, and uptime.
Deploy and manage observability tools (e.g., Prometheus, Grafana, Splunk) tailored for on-premises setups, using them to monitor system health and improve visibility.
Proactively monitor server and network performance, troubleshooting any issues using tools like SolarWinds, Wireshark, tcpdump, and NetFlow to ensure optimal operation and stability.
Create automation scripts (Python, Bash, Ansible) to streamline infrastructure maintenance, observability integration, and network monitoring.
Build and maintain CI/CD pipelines for efficient deployment of observability tools across on-premises systems.
Use observability tools to assess the performance of both internal products and competitor offerings, identifying strengths, weaknesses, and trends in the marketplace.
Utilize observability tools to monitor and analyze the performance, reliability, and user experience of various digital products in the industry.
Your analysis will help us improve our offerings and stay competitive.
Set up monitoring, logging, and alert systems to ensure our products remain reliable and high-performing at all times.
Conduct performance tuning and capacity planning for on-premises servers, network devices, and observability tools to ensure reliability and scalability.
Maintain redundancy and failover strategies to minimize downtime and ensure continuous system uptime.
Develop and maintain documentation on network configurations, observability setups, and troubleshooting procedures.
Create runbooks to guide team members in diagnosing and resolving infrastructure issues efficiently.
Create Dashboards and Alerts. Build dashboards to give real-time insights into systems and product performance. Set up alerts and run books to help teams resolve issues quickly.
Provide Insights and Recommendations. Provide insights and recommendations based on your observations and analysis to guide strategic decisions in technology sales and product development.
Understand Stakeholder Needs. Collaborate with sales, development, and operations teams to understand product performance requirements and deliver observability solutions that support business goals.
Implement observability best practices, integrating monitoring and alerting tools directly with on-premises hardware and infrastructure.
Architect advanced network topologies, including Layer 2 and Layer 3 networks with redundancy, failover, and inter-VLAN routing to enhance system resilience.
Design and Manage Observability Platforms. Implement and maintain observability tools to monitor both cloud (AWS, Azure, GCP) and on-premises systems, ensuring that everything runs smoothly and any issues are quickly identified.
Required Skills & Experience
Proven experience as a Network and Infrastructure Engineer or similar role, with an emphasis on on-premises network management and observability integration.
Advanced knowledge of on-premises server management, including storage, compute, and networking.
Expertise in Layer 2 and Layer 3 network design, VLANs, VRFs, redundancy, and failover mechanisms.
Strong understanding of monitoring and alerting setups for local server and network performance.
Hands-on experience with observability tools like Prometheus, SolarWinds, Grafana, Splunk, Datadog, or New Relic.
Proficient in scripting (Python, Bash, Terraform, or Ansible) and automation.Familiarity with cloud platforms (AWS, Azure, or GCP) and on-premises infrastructure.
Experience with container orchestration tools like Kubernetes and understanding of microservices architecture.
Experience with CI/CD pipelines in a DevOps environment.Strong problem-solving and communication skills.
Preferred Qualifications:
Experience with AIOps (Artificial Intelligence for IT Operations) or machine learning-based observability tools.
Certifications in cloud platforms (AWS, Azure, or GCP).
Knowledge of performance optimization and capacity planning.
Background in performance optimization and capacity planning for on-premises environments.
Certifications related to networking or infrastructure management, such as CCNA, CompTIA Network+, or similar.
Benefits
Highly competitive salary
Flexible working hours and fully remote work.
Learning and development opportunities
Collaborative, innovative, and inclusive work environment.