Principal Software Engineer, HPC – New York Genome Center – New York, NY – Texas A&M Genomics & Bioinformatics Service

Job ID: 5498
Job date: 2016-05-02
End Date:

Country :

[Click Here to Access the Original Job Post]

Job Description:
About the New York Genome Center

The New York Genome Center (NYGC) is an independent, non-profit organization that leverages the collaborative resources of leading academic medical centers, research universities, and commercial organizations. Our vision is to transform medical research and clinical care in New York and beyond through the creation of one of the largest genomics facilities in North America, integrating sequencing, bioinformatics, and data management, as well as performing cutting-edge genomics research.

The New York Genome Center is a VEVRAA Federal Contractor. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, sexual orientation, national origin, age, disability, or protected veteran status. The New York Genome Center takes affirmative action in support of its policy to and advance in employment individuals who are minorities, women, protected veterans, and individuals with disabilities.

Put your software engineering skills in massively parallel processing and high-throughput workflow management to use in designing and building the analysis workflow infrastructure for genomic analysis. The system must support thousands of hours of processing per terabyte, and tens of terabytes per day. No prior experience in genome biology is required. This is an opportunity to learn bioinformatics, and participate in cutting-edge research to apply genomic discoveries to the clinical care of patients with diseases such as cancer, ALS, Alzheimer’s, and inflammatory diseases.

The incumbent will support our rapidly expanding informatics division in developing high-throughput pipelines and large-scale data management systems. The ideal candidate will be a pragmatic team player with experience in high-performance computing and management of “big data”.

Job duties will include:
Collaborate with bioinformatics scientists, users, and stakeholders to define requirements and specifications for data processing pipelines and analysis tools;

Prototype, develop, deploy, document, and support production-quality, fault-tolerant, high-performance software;

Convert prototype algorithms developed by computational biologists into robust, optimized code suitable to run in automated pipelines;

Support customization and integration of local software, including our Lab Information Management System (LIMS) and integration with other Enterprise software;

Port computational methods and pipelines to cloud environments, and develop protocols for cloud-bursting;

Develop expertise in the scientific and computational resources of the center.

Required Skills

BS in Computer Science, computer engineering, or related field plus 10 years related experience; master’s degree plus 8 years related experience, or equivalent combination of education and experience;

Demonstrated ability to design and build large-scale software systems, and to produce readable, documented code;

Ability to work independently to deliver quality code under tight deadlines;

Proficiency in Java, Scala, or similar languages;

Experience with large databases: relational (e.g., mySQL, PostgreSQL), column-oriented databases, array databases, or NoSQL databases (Accumulo, Couchbase, etc);

Experience with version control and source code management systems (e.g.,GIT);

Experience with applications that do massively parallel processing of large amounts of data in 24/7 automated environments;

Experience building applications that are fault tolerant and can handle edge conditions and errors without wasting compute cycles;

Familiarity with batch scheduled high throughput computing using queuing systems (e.g., SGE or LSF) helpful;

Experience with agile programming techniques;

Experience with scientific applications, and genomics in particular, is a plus. Interest in learning about genetics and biomedicine is essential;

Excellent communication skills and proven ability to work directly with customers to understand their needs and translate those into actionable software requirements;

Ability to make decisions with incomplete information and produce results in a timely fashion;

Must be able to work in a fast-paced, start-up like environment.

Required Experience

Requeriments :

Skills :

Areas :

Bioinformatics

Additional Info:

[Click Here to Access the Original Job Post]