Company Description
Job Description
Engineer / Senior Engineer / Principal Engineer - Web scraping, Python - Pune, Chennai, Vadodara India
REF29681T
"The very rapid development of e-commerce today gives access to thousands of information. These data are difficult to exploit by companies, which then have difficulty choosing the right levers of action and measuring their impact.This is where Data Impact by NielsenIQ comes in! Every day, we collect more than 60 billion pieces of information, process them and use them in innovative monitoring and action tools for professionals in the sector. Our goal: to give our customers and consumers real-time visibility into the market. Today: Data Impact by NielsenIQ is a leading start-up in the ‘Retail Analytics’ sector.
About the Job
In full growth, particularly internationally, we are looking for new collaborators to join our fabulous team! A young but experienced, dynamic and complementary team: a resolutely start-up spirit! Real job and career opportunities A friendly atmosphere and a climate of trust that promotes autonomy and challenge!"
Responsibilities:
- Responsible for the capture of massive data on the web and mobile terminals, and the design of architectures such as extraction, deduplication, classification, clustering, and filtering;
- Responsible for the design and development of distributed web crawlers, able to independently solve various problems encountered in the actual development process;
- Responsible for the research and development of web page information extraction technology algorithms to improve the efficiency and quality of data capture;
- Responsible for the analysis and warehousing of crawled data, monitoring of the crawler system and abnormal alarms;
- Responsible for designing and developing data collection strategies and anti-shielding rules to improve the efficiency and quality of data collection;
- Responsible for the design and development of core algorithms according to the system data processing flow and business function requirements;
Qualifications
Must Haves:
- Proficient in Python language, familiar with one or more of the commonly used crawler frameworks, such as Scrapy framework or other Web scraping frameworks, with independent development experience.
- Have 1-15 years of experience
- Familiar with vertical search crawlers and distributed web crawlers, deeply understand the principles of web crawlers, have rich experience in data crawling, parsing, cleaning, and storage related projects, and master anti-crawler technology and breakthrough solutions.
- Master the basic operation of linux,
- Experience in distributed crawler architecture design, IP farms and proxy is preferred.
- A solid foundation in data structure and algorithms is preferred.
Good to have:
- Familiar with common data storage and various data processing technologies are preferred.
- Familiar with commonly used frameworks such as ssh, multi-threading, network communication programming related knowledge.
- Familiar with at least one RDBMS and non-structure DB technologies.
- Hands-on experience for crawling any eCommerce platform is a big plus.
Additional Information
- Enjoy a flexible and rewarding work environment with peer-to-peer recognition platforms.
- Recharge and revitalize with help of wellness plans made for you and your family.
- Plan your future with financial wellness tools.
- Stay relevant and upskill yourself with career development opportunities.
Our Benefits
- Flexible working environment
- Volunteer time off
- LinkedIn Learning
- Employee-Assistance-Program (EAP)
About NIQ
NIQ is the world’s leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most comprehensive consumer insights—delivered with advanced analytics through state-of-the-art platforms—NIQ delivers the Full View™. NIQ is an Advent International portfolio company with operations in 100+ markets, covering more than 90% of the world’s population.
For more information, visit NIQ.com
Want to keep up with our latest updates?
Follow us on: LinkedIn | Instagram | Twitter | Facebook
Our commitment to Diversity, Equity, and Inclusion
NIQ is committed to reflecting the diversity of the clients, communities, and markets we measure within our own workforce. We exist to count everyone and are on a mission to systematically embed inclusion and diversity into all aspects of our workforce, measurement, and products. We enthusiastically invite candidates who share that mission to join us. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class. Our global non-discrimination policy covers these protected classes in every market in which we do business worldwide. Learn more about how we are driving diversity and inclusion in everything we do by visiting the NIQ News Center: https://nielseniq.com/global/en/news-center/diversity-inclusion
Other Jobs from NielsenIQ
Data Engineer - SAP Business Objects
Technology Support Engineer
Customer Configuration Project Manager
Similar Jobs
Senior Data Scientist
Site Reliability Engineer
Software Engineer 2
Product Security Engineering Manager
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say