Guias

Is web scraping legal? 🫢😳








🔗 Follow me on LinkedIn 👉
🆇 OR on X/Twitter 👉

Courses for Data Nerds
==================================
📜 Google Data Analytics Certificate (START HERE) 👉🏼
💿 SQL for Data Science 👉🏼
🧾 Excel Skills for Business 👉🏼 
🐍 Python for Everybody 👉🏼
📊 Data Visualization with Tableau 👉🏼 
🏴‍☠️ Data Science: Foundations using R 👉🏼
➕ Coursera Plus Subscription (7-day free trial) 👉🏼
👨🏼‍🏫 All courses 👉🏼

Build a Portfolio
==================================
👩🏻‍💻Build portfolio here 👉🏼
Rebate Code: “LUKE”
My Portfolio 👉🏼

Books for Data Nerds
==================================
📚 Books I’ve read 👉🏼
📗 Data Analyst Must Read 👉🏼
📙 Tableau 👉🏼
📘 Power BI👉🏼
📕 Python 👉🏼

Tech for Data Nerds
==================================
⚙️ Tech I use 👉🏼
🪟Windows on a Mac (Parallels VM) 👉🏼
👨🏼‍💻 M1 Macbook Air (Mac of choice) 👉🏼
💻 Dell XPS 13 (PC of choice) 👉🏼
💻 Asus Vivo Book (Lowest Cost PC) 👉🏼
💻Lenovo IdeaPad (Best Value PC)👉🏼

Social Media / Contact Me
======================
🙋🏼‍♂️Newsletter:
🌄 Instagram:
⏰ TikTok:
📘 Facebook:
📥 Business Inquiries: luke@lukebarousse.com

As a member of the Amazon, Coursera, Hostinger, and Parallels Affiliate Programs, I earn a commission from qualifying purchases on the links above. It costs you nothing but helps me with content creation.

#dataanalyst #datascience

Link do Vídeo






30 Comentários

  1. IDK if the bot you program have some sort of rate limiting or like a delay of 1sec between each request!!

  2. I don't understand why this is illegal or why anyone would even care. What's wrong with collecting data efficiently?

  3. i think you are just scraping too fast. or collecting too much data from one ip, likely both. its like how there is a limit of 70 connections per day on linked in you just need to stay within the amount of data they allow you

  4. They have anti scraping measures now too. I mean the site basically useless if you dont scrape it because the search is literally dogwater and i found it was the only way to actually filter the results to get actually relevant jobs

  5. So…just state it isn't illegal (in state law)

    I think all these companies need to grow up and realize they are sending us paper catalogues with webpages. When we get the page we can do the fuck we want with it (privately)

  6. Go through a public dataset manually
    LinkedIn: 😄
    Go through a public dataset with a bot
    LinkedIn: 😠

  7. Me my question is : how did you do this web scraping stuff? I mean just show where to find the place to learn. I will dedicate 24h straight of my life to learn. I will be very happy. As a data neird, am going crazy of all of the flashy stuff on internet but with no value. Help me.

  8. A few years ago I scraped data that was in the public domain, from websites around the world. I never had a problem with accessing the web pages. The problem was that the webpages changed. You had to constantly rewrite the scraping code, or change inputs to scraping tools. It might have cost less and reduced a lot of stress. Just by hiring low cost labor to manually input the data.

  9. I don't get it … Its no different from a real person sitting there and copy pasting things all day. Or they want you to do it manually so you can suffer …

  10. I already knew that thats why never tried with LinkedIn.
    There are Github projects for that as well but doesn’t come with warranty.

  11. Data viewed by the public on the internet via a privately owned corporate site does not necessarily equal public data.

  12. This was actually a project idea that I had for quite some time, to see job distribution in different states/countries, cross relate to salary by company from GlassDoor and all that, while researching, I discovered that there is an informal LinkedIn API, so you don’t actually need to scrape all the data, quite helpful

    There are a bunch of articles on Medium about it too

  13. I make a weather API. But now it give me an error like you have been blocked because we have registered an unusual ammount of traffic from your IP address.

    So I can't finish my project because of this. How can I solve this issue

Comentários estão fechados.