Skip to content
0
  • Home
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
  • Home
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Sketchy)
  • No Skin
Collapse

Wandering Adventure Party

  1. Home
  2. Uncategorized
  3. https://OpenStreetMap.org has been disrupted today.

https://OpenStreetMap.org has been disrupted today.

Scheduled Pinned Locked Moved Uncategorized
openstreetmapddosscrapers
42 Posts 28 Posters 7 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • George E. πŸ‡ΊπŸ‡Έβ™₯πŸ‡ΊπŸ‡¦πŸ‡΅πŸ‡ΈπŸ³οΈβ€πŸŒˆπŸ³οΈβ€βš§οΈG George E. πŸ‡ΊπŸ‡Έβ™₯πŸ‡ΊπŸ‡¦πŸ‡΅πŸ‡ΈπŸ³οΈβ€πŸŒˆπŸ³οΈβ€βš§οΈ

    @osm_tech@en.osm.town
    Time to deploy Anubis in front of all the URLs that the scrapers are hitting.

    OpenStreetMap Ops TeamO This user is from outside of this forum
    OpenStreetMap Ops TeamO This user is from outside of this forum
    OpenStreetMap Ops Team
    wrote last edited by
    #20

    @gme Anubis is great, but not ideal for us. We have reasonable technical mitigations in place, but it is unfortunately an arms race (and time/resource blackhole)

    1 Reply Last reply
    0
    • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

      @Don_Clemente Just dumb scrapers. They want our data, so maybe we /should/ redirect them to the latest full history planet file, it is only 245GB πŸ˜‰

      Don Clemente πŸ”œ39C3 ☎️2470D This user is from outside of this forum
      Don Clemente πŸ”œ39C3 ☎️2470D This user is from outside of this forum
      Don Clemente πŸ”œ39C3 ☎️2470
      wrote last edited by
      #21

      @osm_tech yes, but that's heavy on your resources and bandwidth. A zip file of 10GB Zeros is much more resource-efficient on your side πŸ˜‰

      The DoctorD 1 Reply Last reply
      0
      • Petri SalmelaP Petri Salmela

        @ISibboI @osm_tech I think, that spreading it on 100 000+ IP addresses tells they know they are doing shady thing.

        G This user is from outside of this forum
        G This user is from outside of this forum
        Gerard Thornley
        wrote last edited by
        #22

        @pesasa @ISibboI @osm_tech It's a pattern I've seen others relate to the building of AI datasets.

        G 1 Reply Last reply
        0
        • G Gerard Thornley

          @pesasa @ISibboI @osm_tech It's a pattern I've seen others relate to the building of AI datasets.

          G This user is from outside of this forum
          G This user is from outside of this forum
          Gerard Thornley
          wrote last edited by
          #23

          @pesasa @ISibboI @osm_tech
          If its helpful (if you want to stuff something unpleasant down their throats), I believe @jwz has been making a 'json bomb' for (AIUI) pretty much this purpose:
          https://mastodon.social/@jwz/116049057703097965

          1 Reply Last reply
          0
          • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

            https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

            WTLW This user is from outside of this forum
            WTLW This user is from outside of this forum
            WTL
            wrote last edited by
            #24

            @osm_tech @lehtimaeki I wonder if anyone has looked at the various AI browsers out there to see if they're used for active scraping.

            1 Reply Last reply
            0
            • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

              @Don_Clemente Just dumb scrapers. They want our data, so maybe we /should/ redirect them to the latest full history planet file, it is only 245GB πŸ˜‰

              IvΓ‘n SΓ‘nchez OrtegaI This user is from outside of this forum
              IvΓ‘n SΓ‘nchez OrtegaI This user is from outside of this forum
              IvΓ‘n SΓ‘nchez Ortega
              wrote last edited by
              #25

              @osm_tech @Don_Clemente Replacing the actual data with Null Island data that points to planet.osm.org is not a dumb idea IMHO

              1 Reply Last reply
              0
              • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

                Martin RustM This user is from outside of this forum
                Martin RustM This user is from outside of this forum
                Martin Rust
                wrote last edited by
                #26

                @osm_tech Block all read access except from registered users? And be restrictive on whom to grant an account (captcha, verified email address)?

                Open RiskO 1 Reply Last reply
                0
                • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                  https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

                  Bill WoodcockW This user is from outside of this forum
                  Bill WoodcockW This user is from outside of this forum
                  Bill Woodcock
                  wrote last edited by
                  #27
                  @osm_tech

                  Please, please, AI bubble, burst already, so we can stop seeing this kind of vandalism.
                  1 Reply Last reply
                  0
                  • nag4wikaN nag4wika

                    @osm_tech cc telecomix

                    The DoctorD This user is from outside of this forum
                    The DoctorD This user is from outside of this forum
                    The Doctor
                    wrote last edited by
                    #28

                    @osm_tech @nag4wika Yuh?

                    ~~~==:}}

                    1 Reply Last reply
                    0
                    • Don Clemente πŸ”œ39C3 ☎️2470D Don Clemente πŸ”œ39C3 ☎️2470

                      @osm_tech yes, but that's heavy on your resources and bandwidth. A zip file of 10GB Zeros is much more resource-efficient on your side πŸ˜‰

                      The DoctorD This user is from outside of this forum
                      The DoctorD This user is from outside of this forum
                      The Doctor
                      wrote last edited by
                      #29

                      @osm_tech @Don_Clemente Can confirm. >:D

                      1 Reply Last reply
                      0
                      • Martin RustM Martin Rust

                        @osm_tech Block all read access except from registered users? And be restrictive on whom to grant an account (captcha, verified email address)?

                        Open RiskO This user is from outside of this forum
                        Open RiskO This user is from outside of this forum
                        Open Risk
                        wrote last edited by
                        #30

                        @martinrust

                        had a similar issue with a much, much smaller wiki server (πŸ˜… ) but facing the same anonymous scrapers coming from thousands of random IP's.

                        Couldn't find a way to keep them from crashing the server, so now access is by registration only.

                        Very disruptive, this "AI" race to the bottom destroyed the open internet 😟

                        @osm_tech

                        1 Reply Last reply
                        0
                        • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                          https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

                          Air Quotes ComedianA This user is from outside of this forum
                          Air Quotes ComedianA This user is from outside of this forum
                          Air Quotes Comedian
                          wrote last edited by
                          #31

                          @osm_tech I went to use OpenStreetMap today to get directions but it asked me to sign up/ log in, so I used Mapquest instead which doesn't.

                          OpenStreetMap Ops TeamO 1 Reply Last reply
                          0
                          • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                            @ISibboI Correct. Not an ounce of brain. It has been going on for months, but it is just getting worse. Waste of their resources/time and ours.

                            Jonas :debian: :norway_fb:J This user is from outside of this forum
                            Jonas :debian: :norway_fb:J This user is from outside of this forum
                            Jonas :debian: :norway_fb:
                            wrote last edited by
                            #32

                            @osm_tech @ISibboI Scraping the OpenStreetMap website gives of the same vibe as stealing free stuff.

                            Link Preview Image
                            1 Reply Last reply
                            0
                            • Air Quotes ComedianA Air Quotes Comedian

                              @osm_tech I went to use OpenStreetMap today to get directions but it asked me to sign up/ log in, so I used Mapquest instead which doesn't.

                              OpenStreetMap Ops TeamO This user is from outside of this forum
                              OpenStreetMap Ops TeamO This user is from outside of this forum
                              OpenStreetMap Ops Team
                              wrote last edited by
                              #33

                              @Air_Quotes_Comedian Weird. We don't require login for directions.

                              Link Preview Image
                              Air Quotes ComedianA 1 Reply Last reply
                              0
                              • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                                @Air_Quotes_Comedian Weird. We don't require login for directions.

                                Link Preview Image
                                Air Quotes ComedianA This user is from outside of this forum
                                Air Quotes ComedianA This user is from outside of this forum
                                Air Quotes Comedian
                                wrote last edited by
                                #34

                                @osm_tech Hmmm.

                                Perhaps I'm the only one who is required to log in for directions.

                                MapAmπŸ’œreM 1 Reply Last reply
                                0
                                • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                                  https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

                                  ForestF This user is from outside of this forum
                                  ForestF This user is from outside of this forum
                                  Forest
                                  wrote last edited by
                                  #35

                                  @osm_tech thank you for your service πŸͺ–

                                  1 Reply Last reply
                                  0
                                  • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                                    @ISibboI Correct. Not an ounce of brain. It has been going on for months, but it is just getting worse. Waste of their resources/time and ours.

                                    Not meN This user is from outside of this forum
                                    Not meN This user is from outside of this forum
                                    Not me
                                    wrote last edited by
                                    #36

                                    @osm_tech @ISibboI isn't there a chance that service disruption is one of the objectives?

                                    OpenStreetMap Ops TeamO 1 Reply Last reply
                                    0
                                    • Not meN Not me

                                      @osm_tech @ISibboI isn't there a chance that service disruption is one of the objectives?

                                      OpenStreetMap Ops TeamO This user is from outside of this forum
                                      OpenStreetMap Ops TeamO This user is from outside of this forum
                                      OpenStreetMap Ops Team
                                      wrote last edited by
                                      #37

                                      @Notme @ISibboI We are not unique in battling these scrapers; Wikipedia, KDE, Gnome, OpenWRT, Arch Linux and many other projects have the some issue.

                                      1 Reply Last reply
                                      0
                                      • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                                        https://OpenStreetMap.org has been disrupted today. We're working to keep the site online while facing extreme load from anonymous scrapers spread across 100,000+ IP addresses. Please be patient while we mitigate and protect the service. #OpenStreetMap #DDoS #Scrapers #AI

                                        AssimilateborgA This user is from outside of this forum
                                        AssimilateborgA This user is from outside of this forum
                                        Assimilateborg
                                        wrote last edited by
                                        #38

                                        @osm_tech I blocked user agents matching "Chrome.139.0.0.0 Safari.537.36" as this combination seems to not exist, except for millions of requests out of china.

                                        1 Reply Last reply
                                        0
                                        • OpenStreetMap Ops TeamO OpenStreetMap Ops Team

                                          @geospacedman @ISibboI They are scraping the website pages: /ways/, /nodes/ and /relations/. All this data is published on planet.osm.org already.

                                          ideogramI This user is from outside of this forum
                                          ideogramI This user is from outside of this forum
                                          ideogram
                                          wrote last edited by
                                          #39

                                          @osm_tech
                                          Bloody idiots. I wonder if it's AI platforms?
                                          @geospacedman @ISibboI

                                          1 Reply Last reply
                                          0

                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          • First post
                                            Last post