I wondered about the robots.txt

I can see the case for it, I could also see the case for allowing at least Google to index the site.

Has there been some discussion about this previously?

    • Sam_uk@slrpnk.netOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      6 months ago

      I think it would just be

      User-agent: *
      Disallow: /
      User-agent: Googlebot
      Allow: /
      
      • poVoq@slrpnk.netM
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        6 months ago

        Ok I tried to allow-list some search engine spiders in the robot.txt, however they will probably still just run into the AI scraper block if they act too shady.

        But honestly, I highly doubt we will get much traffic from Google search. It’s completely gone to shit these days.