Build reliable and efficient pipelines that transform and transport data into formats that data scientists can use for analysis; Gather and document data management and analysis requirements; Develop and maintain ETL pipelines in Python (e.g. pulling data from a REST API or directly from cloud-based databases); Design, create, deploy and manage the logical and physical data architecture; Create and maintain data dictionaries and data mappings; Ensure that the architecture supports the requirements of data scientists and other stakeholders
University degree (Master’s or equivalent) in a technical discipline; 5+ years of experience in a similar data management role; MS SQL Server, T-SQL experience and PostgreSQL; Python or other data transformation tools; Experience with Docker containers; Data/information modeling expertise at the enterprise level, working with large and complex data models; Strong SQL and ETL skills; Experience optimizing physical designs for effective ETL and maximum query performance
Database Administrator/Architect/Developer Center for Biosustainability Share on Facebook Share on Twitter Share on Linkedin Monday 13 Sep 21 Apply for this job Apply no later than 30 September 2021 Apply for the job at Center for Biosustainability by completing the following form. Apply online Are you a seasoned database administrator, architect, or developer and interested in extending your skills in the emerging field of graph databases? - then you might need to look closer here At the Center for Biosustainability, where we engineer commercialization-ready cell factories for the production of biochemicals and therapeutics, we need near-real-time data analytics as we iterate through design-build-test-learn engineering cycles. We are looking for an experienced database administrator/architect/developer who can maintain a centralized data management infrastructure, that will allow us to rapidly integrate data that is needed for the diagnosis and optimization of cell factories and allow project teams to collaborate to make the best possible engineering decisions. Data may include experimental data from a cloud-based Laboratory Information Management System, multi-omic data (such as genomics, transcriptomics, proteomics, metabolomics, fluxomics), and unstructured project information. Responsibilities include Build reliable and efficient pipelines that transform and transport data into formats that data scientists can use for analysis. Gather and document data management and analysis requirements. Develop and maintain ETL pipelines in Python (e.g. pulling data from a REST API or directly from cloud-based databases) Design, create, deploy and manage the logical and physical data architecture Create and maintain data dictionaries and data mappings. Ensure that the architecture supports the requirements of data scientists and other stakeholders. Create data repositories, marts and develop reporting solutions as needed. Conduct research in new tools and technologies that shape the future of data analytics. Qualifications & experience MS SQL Server, T-SQL experience and PostgreSQL. Python or other data transformation tools. Experience with Docker containers. Data/information modeling expertise at the enterprise level, working with large and complex data models Strong SQL and ETL skills. Experience optimizing physical designs for effective ETL and maximum query performance. Must have prior experience leading successful data warehousing implementations. Experience with a graph database, specifically Neo4J is a big plus. Experience with managing microbial genome sequences and omics data is highly desired. Excellent written and verbal communication skills, including documentation. Experience with MS Azure is a plus. Education & special training University degree (Master’s or equivalent) in a technical discipline. 5 years of experience in a similar data management role. Salary and terms of employment The appointment will be based on the collective agreement with the Danish Confederation of Professional Associations. The allowance will be agreed upon with the relevant union. This employment is limited to 31.12.2025 Application and contact Please submit your online application no later than 30 September 2021 . Applications must be submitted in English as one PDF file containing all materials to be considered CV, cover letter, diploma and if relevant list of publications. If you have any questions, you are very welcome to contact Vincenzo Capuano, Director of Software Engineering at email@example.com All interested candidates irrespective of age, gender, race, disability, religion or ethnic background are encouraged to apply. The Novo Nordisk Foundation Center for Biosustainability (DTU Biosustain) Recent progress in our ability to read and write genomic code, combined with advances in automation, analytics and data science, has fundamentally changed the scope and ambition of harnessing the potential of biological systems. Big data approaches and analysis of biological systems are key research instruments at the Center. DTU Biosustain utilizes these advances for microbial cell factory design to foster sustainable lifestyles in relation to three application areas: Sustainable Chemicals, Natural Products, and Microbial Foods. Learn more at biosustain.dtu.dk Technology for people DTU develops technology for people. With our international elite research and study programmes, we are helping to create a better world and to solve the global challenges formulated in the UN’s 17 Sustainable Development Goals. Hans Christian Ørsted founded DTU in 1829 with a clear vision to develop and create value using science and engineering to benefit society. That vision lives on today. DTU has 12,900 students and 6,000 employees. We work in an international atmosphere and have an inclusive, evolving, and informal working environment. DTU has campuses in all parts of Denmark and in Greenland, and we collaborate with the best universities around the world.
Please click here, if the job didn't load correctly.
Please wait. You are being redirected to the job in 3 seconds.