Notice: Welcome to TinyChan, an account has automatically been created and assigned to you, you don't have to register or log in to use the board, but don't clear your cookies unless you have set a memorable name and password. Alternatively, you can restore your ID. The use of this site requires cookies to be enabled; please cease browsing this site if you don't consent.

TinyChan

Topic: Random thing

+Anonymous A7 months ago #67,266

Idk if anybody else knows this but Russia frequently scrapes the content on both MiniChan and TinyChan.

+Anonymous B7 months ago, 1 hour later[T] [B] #668,708

Proof? Or is this yet another one of your politisperg threads?

+Anonymous C7 months ago, 22 minutes later, 1 hour after the original post[T] [B] #668,710

@previous (B)
YandexBot

·Anonymous C7 months ago, 54 seconds later, 1 hour after the original post[T] [B] #668,711

You know what? You can prove it for yourself. Make a grabify link, post it on here, come back tomorrow, and check the log and you’ll see YandexBot in there.

·Anonymous B7 months ago, 14 minutes later, 1 hour after the original post[T] [B] #668,712

@668,710 (C)
@previous (C)
If Yandex is easily identifiable, how does Google index web pages?
I found this: https://developers.google.com/search/apis/ipranges/googlebot.json

(Edited 2 minutes later.)


·Anonymous C7 months ago, 8 minutes later, 2 hours after the original post[T] [B] #668,713

@previous (B)
The user agent is "YandexBot"

+ᏧᏟ ᎩᎦᎨ7 months ago, 5 hours later, 7 hours after the original post[T] [B] #668,768

@previous (C)
Good catch, but they're not just scraping but harvesting for analysis the responses to their contributions. When the number of users spikes and the threads and data for them pass by in seconds instead of minutes, I was wondering who was doing it.
True if big. This one's for Ivan... https://www.youtube.com/watch?v=QpxA_ZxGX_M
:

You are required to fill in a captcha for your first 5 posts. Sorry, but this is required to stop people from posting while drunk. Please be responsible and don't drink and post!
If you receive this often, consider not clearing your cookies.



Please familiarise yourself with the rules and markup syntax before posting.