IT Infrastructure Specialist

IT Infrastructure Specialist
نوع العمل : عمل كلى
الخبرة : 3-5 سنة
الراتب : not mentioned
المكان : Egypt

Job Details

Experience Needed:3 to 7 years
Career Level:Experienced (Non-Manager)
Education Level:Not Specified
Gender:Males preferred
Salary:Confidential

Job Description

Al Farouk Educational Services is seeking an experienced **IT Infrastructure Specialist** to design, implement, and maintain comprehensive IT infrastructure supporting our educational technology platforms. This role requires advanced technical expertise in server management, database administration, Python automation, and scalable platform architecture to support large-scale e-learning operations and student account management systems.

The ideal candidate will have deep experience managing multi-user systems at scale, optimizing high-transaction databases, developing automation scripts, and implementing infrastructure monitoring to ensure high availability and performance.

Infrastructure & System Management
 

- Design, implement, and maintain IT infrastructure for educational technology platforms supporting large-scale operations
- Manage high-performance Linux/Ubuntu server infrastructure for databases, application servers, and content delivery
- Monitor system performance, conduct capacity planning, and optimize infrastructure for scalability
- Coordinate infrastructure upgrades, system migrations, and technology refresh cycles with minimal service disruption


Database Administration & Optimization

 

- Design and manage PostgreSQL database infrastructure for student account management, learning analytics, and content repositories
- Implement database optimization strategies including indexing, query tuning, and performance monitoring for high-transaction workloads
- Develop automated database maintenance procedures, backup strategies, and disaster recovery protocols
- Create complex SQL queries, stored procedures, and database reports for educational data management

 

Automation & Development

 

- Develop Python automation scripts for infrastructure management, user account provisioning, and system maintenance
- Implement automated monitoring and alerting systems for proactive incident detection and resolution
- Create CLI tools and RESTful APIs using Python frameworks (FastAPI/Flask) for system administration and operational efficiency
- Build scheduled automation jobs (cron) for routine maintenance, data processing, and system health checks

 

Platform Integration

 

- Develop and maintain RESTful APIs for educational platform integration and third-party service connectivity
- Integrate third-party educational technology services and APIs for enhanced platform functionality
- Implement API authentication, rate limiting, and security best practices for platform access control
- Create comprehensive API documentation and integration guides for internal teams

 

Network & Security

 

- Manage network architecture, security protocols, and access control systems for educational technology infrastructure
- Implement VPN configurations, firewall rules (iptables), and network security best practices
- Ensure compliance with data protection requirements and educational technology security standards
- Conduct regular security audits and vulnerability assessments for infrastructure components

 

Monitoring & Performance

 

- Deploy and configure monitoring tools (Prometheus, Grafana, or similar) for real-time infrastructure visibility
- Create performance dashboards, alerting systems, and operational metrics tracking
- Conduct performance analysis, bottleneck identification, and provide optimization recommendations
- Implement centralized logging and audit systems for compliance and troubleshooting

 

Documentation & Support

 

- Develop and maintain comprehensive technical documentation for infrastructure systems, APIs, and procedures
- Create runbooks, standard operating procedures (SOPs), and disaster recovery documentation
- Provide technical support and troubleshooting for infrastructure-related issues
- Train operations team on system usage, best practices, and troubleshooting procedures

 

Job Requirements

Education & Experience

 

Bachelor's degree in Information Technology, Computer Science, Computer Engineering, or related field
Minimum 3-5 years of hands-on IT infrastructure experience in technology companies, digital services, or educational technology sectors
Proven experience managing large-scale multi-user systems with thousands of concurrent user accounts
Experience with high-transaction database systems supporting complex user management and data processing requirements

Core Technical Skills (Must-Have)

- ✅ Advanced Python programming with strong experience in automation, scripting, and API integration
- ✅ PostgreSQL database design, optimization, and administration for production environments
- ✅ Linux system administration (Ubuntu/Debian preferred) including bash scripting, service management, and troubleshooting
- ✅ API development using Python frameworks (FastAPI, Flask, or Django) for backend services
- ✅ Infrastructure monitoring tools and practices for production system reliability
- ✅ Network fundamentals including TCP/IP, DNS, VPN, proxy configuration, and load balancing

 

 Additional Technical Skills (Preferred)

- Experience with browser automation frameworks (Selenium, Playwright) for testing and QA automation
- Knowledge of multi-account management systems, user provisioning, and identity management platforms
- Familiarity with proxy servers, content delivery networks (CDN), and distributed systems architecture
- Understanding of Infrastructure as Code (Terraform, Ansible) and DevOps practices
- Experience with Redis, Nginx, and high-performance web architectures
- Knowledge of security best practices including encryption, secrets management (HashiCorp Vault), and access control
- Familiarity with Docker, Kubernetes, and containerization technologies
- Experience with cloud platforms (AWS, Azure, or GCP) for hybrid infrastructure

 

 Soft Skills & Personal Attributes

- Strong analytical and problem-solving abilities for complex technical challenges
- Ability to work under pressure and respond quickly to critical infrastructure incidents
- Excellent written and verbal communication skills in English (Arabic is a plus)
- Detail-oriented approach to system configuration, monitoring, and troubleshooting
- Proactive mindset with ability to anticipate issues and implement preventive measures
- Self-motivated with ability to work independently and manage multiple priorities
- Team player with experience collaborating across technical and non-technical teams

 

Key Performance Indicators (KPIs)

 

 Uptime & Reliability

- Maintain 99.5%+ uptime for all critical infrastructure and database systems
- Achieve <10 minutes mean time to detection (MTTD) for infrastructure failures
- Maintain <30 minutes mean time to recovery (MTTR) for critical system incidents

 

Performance Optimization

- Achieve <2 seconds response time for 95% of database queries and API requests
- Maintain <100ms latency for critical user-facing operations
- Support concurrent user loads without performance degradation during peak hours

 

Automation & Efficiency

- Automate 80%+ of routine maintenance and operational tasks
- Reduce manual intervention time to <1 hour per day for infrastructure management
- Implement automated monitoring and alerting for 100% of critical systems

 

Security & Compliance

- Maintain 100% compliance with data protection requirements and security standards
- Complete quarterly security audits with all high/critical findings remediated within 30 days
- Zero security incidents related to infrastructure vulnerabilities or misconfigurations

 

Documentation & Support

- Achieve 90%+ satisfaction scores for infrastructure reliability and support (quarterly survey)
- Maintain comprehensive documentation for all critical systems (100% coverage)
- Complete all scheduled maintenance within planned windows with zero service disruption



للتقديم الان