Back to jobs

Senior Python Developer (web)

We are looking for a skilled Web Scrapers Developer to join our team. You will be responsible for designing and implementing web scraping solutions to collect data from various websites. This role requires a strong understanding of web technologies, data extraction, and manipulation techniques. The candidate should be proficient in Python and have experience with popular web scraping frameworks and tools.

Your Responsibilities Will Include:

- Design and implement web scraping solutions to collect data from diverse sources.

- Develop and maintain web scrapers using Python and relevant libraries.

- Handle large volumes of data and ensure data accuracy and integrity.

- Utilize proxy services and headless browsers to bypass IP blocks and CAPTCHAs.

- Work with databases to store and manage scraped data.

- Ensure compliance with website terms of service and legal guidelines.

- Collaborate with the data engineering team to integrate scraped data into the data pipeline.

 

Our Technical Requirements:

* Programming Language:

  - Proficiency in Python.

* Libraries and Frameworks:

  - Experience with BeautifulSoup for HTML and XML parsing.

  - Experience with Scrapy for complex and large-scale scraping projects.

  - Proficiency in Selenium for browser automation and interaction with JavaScript-heavy pages.

  - Experience with Requests for sending HTTP requests.

* Databases:

  - Knowledge of SQLite for small projects.

  - Experience with PostgreSQL for larger projects.

  - Familiarity with MongoDB for storing unstructured data.

* Other Tools and Technologies:

  - Experience with Docker for containerizing applications.

  - Understanding of proxy services to bypass IP blocks and restrictions.

  - Familiarity with Headless Chrome for automating interactions with websites without a graphical interface.

* CAPTCHA Bypassing:

  - Experience with 2Captcha or Anti-Captcha services for solving CAPTCHAs.

  - Familiarity with Puppeteer for automating CAPTCHA interactions.

  - Knowledge of Tesseract OCR for recognizing text-based CAPTCHAs.

* Additional Tools:

  - Proficiency in using Regular Expressions (Regex) for data extraction from text.

 

For an efficient and comfortable work you should have this experience:

- Proven experience in developing web scrapers and data extraction solutions.

- Strong problem-solving skills and the ability to handle complex scraping challenges.

- Experience with handling large datasets and ensuring data accuracy.

- Knowledge of web technologies, HTTP protocols, and web scraping best practices.

- Familiarity with legal and ethical considerations in web scraping.

 

Education:

A degree in Computer Science, Information Technology, or a related field is preferred, but not mandatory if you have relevant experience and skills.

 

How to Apply:

Interested candidates should submit their resume and a cover letter detailing their experience with web scraping projects, along with any relevant code samples or portfolio.

Join us and be part of a dynamic team where your expertise will contribute to our data-driven success!

Apply for this job

*

indicates a required field

Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf