HPC/AI Systems Engineer - AI4S
Responder al anuncioPosition @ Barcelona Supercomputing Center (BSC), Spain
We are looking for candidates with a technical background who will become part of the Operations Department of the Centre.
The funding for these actions/fellowships and contracts comes from the European Union Recovery and Resilience Facility - Next Generation, within the framework of the General Invitation by the public business entity Red. es to participate in the talent attraction and retention programs within Investment 4 of Component 19 of the Recovery, Transformation, and Resilience Plan. For more information, please check: here.
Key Duties
- Installation, maintenance, update and resolution of issues related to IT services of the centre (mail, web, databases, servers, etc. )
- Configuration and administration of the different storage subsystems and backup system.
- Configuration and administration of the BSC HPC supercomputing resources.
- Configuration and administration of BSC cloud platforms (OpenStack, OpenNebula and ovirt).
- Configuration and administration of BSC AI platforms.
Requirements
Education
- Degree/Master's degree in Computer Sciences or similar field.
Essential Knowledge and Professional Experience
- Knowledge and experience in system administration of HPC Linux platforms (4 years minimum)
- Knowledge and experience in system administration of distributed file systems like GPFS (IBM Storage Scale) or Lustre
- Knowledge and experience in system administration of cloud platforms like OpenStack/OpenNebula
Additional Knowledge and Professional Experience
- Experience with tools like Kubernetes, Docker Swarm, or Apache Mesos for container orchestration and resource management
- Experience with GPU clusters, including tools like Nvidia Docker, CUDA, cuDNN, and managing NVIDIA GPUs in a clustered environment
- Knowledge of AI/ML frameworks like TensorFlow, PyTorch, Nvidia Megatron
- Understanding of its deployment and management in a cluster environment
- Familiarity with Docker and managing AI/ML containers with Docker
- Knowledge of object storage systems like Amazon S3, MinIO, or similar technologies
Competences
- Initiative, responsibility and good organizational skills
- Analytical problem-solving skills
- Availability to travel and assist with project events/workshops
Conditions
The position will be located at BSC within the Operations Department. We offer a full-time contract (37. 5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance. Duration: 4 years. Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement. Salary: 50, 000. 00€. Additional Expenses Grant: Each fellowship will be associated with a grant for additional expenses, such as IT equipment, travel, training, stays, etc. Starting date: asap - the incorporation for this vacancy must be before the 16th of December 2024.
#J-18808-Ljbffr¡Sea el primero en responder a este anuncio de trabajo!
-
¿Por qué está buscando trabajo en Trabajas.es?
Crear alerta de empleo
Cada día nuevos anuncios de trabajo Puede elegir entre una amplia gama de trabajos: nuestro objetivo es ofrecer una selección lo más amplia posible Déjenos enviar nuevos anuncios por correo electrónico Sea el primero en responder a las nuevas ofertas de empleo Todos los anuncios de trabajos en un único lugar (de empleadores, agencias y otros portales) Todos los servicios para demandantes de empleo son gratuitos Le ayudaremos a encontrar un nuevo empleo