This item is deleted.

Junior Data Extraction Developer for University of Cambridge

datlab at 2016-01-19 17:22:38

We are looking for new enthusiastic guys and girls to join us on a research project called DIGIWHIST for University of Cambridge. You will be part of our Datlab team situated in Prague, but employed directly by the University of Cambridge. How does that sound to you? :)

DIGIWHIST is a cutting edge Big data project (no buzzwords, it’s really BIG), aiming to gather data on public procurement from the entire Europe. Our team will be responsible for the technological core of the whole project. Among the biggest challenges are extraction of data from various Web environments, its processing and cleaning. Further on, we will publish results as open data and develop indicators to detect corruption and malpractice.

Your tasks will but are not limited to:

  • Automated data collection from official government webpages including text extraction from structured/unstructured data;

  • Building a large scale data processing and management system capable of data cleaning and linking public procurement databases to other databases such as company registration records; and

  • Designing efficient data use interface supporting the use of the collected data by third parties.

Interested? Let's see if you can get through our requirements:

  • Master’s-level degree or comparable work experience

  • Solid computing background, knowledge of object oriented programming principles and design patterns

  • Orientation in database systems - SQL (PostgreSQL), NoSQL (advantage, but not needed)

  • Working knowledge of Java programming language, test-driven development and data analytics are desirable

Job details:
Net wage 1550 - 1750 EUR/month (work as freelancer - OSVČ)
Fulltime (partial home-office is OK)
Location Prague 6 (Hradčanská)

https://www.startupjobs.cz/nabidka/4988 ... -developer
contact: kontakt@datlab.cz