TinyChan

Topic: Random thing

+Anonymous A10 months ago #67,266

Idk if anybody else knows this but Russia frequently scrapes the content on both MiniChan and TinyChan.

+Anonymous B10 months ago, 1 hour later[T] [B] #668,708

Proof? Or is this yet another one of your politisperg threads?

+Anonymous C10 months ago, 22 minutes later, 1 hour after the original post[T] [B] #668,710

@previous (B)
YandexBot

·Anonymous C10 months ago, 54 seconds later, 1 hour after the original post[T] [B] #668,711

You know what? You can prove it for yourself. Make a grabify link, post it on here, come back tomorrow, and check the log and you’ll see YandexBot in there.

·Anonymous B10 months ago, 14 minutes later, 1 hour after the original post[T] [B] #668,712

@668,710 (C)
@previous (C)
If Yandex is easily identifiable, how does Google index web pages?
I found this: https://developers.google.com/search/apis/ipranges/googlebot.json

(Edited 2 minutes later.)


·Anonymous C10 months ago, 8 minutes later, 2 hours after the original post[T] [B] #668,713

@previous (B)
The user agent is "YandexBot"

+ᏧᏟ ᎩᎦᎨ10 months ago, 5 hours later, 7 hours after the original post[T] [B] #668,768

@previous (C)
Good catch, but they're not just scraping but harvesting for analysis the responses to their contributions. When the number of users spikes and the threads and data for them pass by in seconds instead of minutes, I was wondering who was doing it.
True if big. This one's for Ivan... https://www.youtube.com/watch?v=QpxA_ZxGX_M
:

You are required to fill in a captcha for your first 5 posts. Sorry, but this is required to stop people from posting while drunk. Please be responsible and don't drink and post!
If you receive this often, consider not clearing your cookies.



Please familiarise yourself with the rules and markup syntax before posting.