[ home ] [ suggest ] [ soy / qa / g / pol / raid / incel / int / a / muv / qst / tv / r9k / r / giga / x ] [ b / sneed / fap / webm ] [ overboard ] [ rules ] [ blog ] [ kuz ] [ wiki ] [ dailyjak ] [ booru ] [ archive ]

/suggest/ - suggest


Email
Comment
File
Password (For file deletion.)

Janitor applications are now being accepted. Apply here

File: 1663568366070.png (1.01 MB, 800x600, ClipboardImage.png) ImgOps

 No.39736[View All]

Hello, the archives have now released. This thread is for bug reports, suggestions, etc.

Information:
1. Its in beta, nothing is permanent on it for now (except the threads)
2. Things are still being worked on, expect bugs. All of this was written within the last 24 hours.
3. Only /soy/ is going to be archived FOR NOW, once we're satisfied with an initial release, we will expand it to all the boards.
4. The archive updates hourly, threads wont appear until an hour after their made. This will help us prevent CP from being archived.

>why are the images in their own row?

its for multi image support. ill make it look better later

>what order are they sorted in?

from creation date, newest to oldest

>will you add x feature?

Probably yes, let me emphasize its in beta now, its not done yet, not even close.
107 posts and 19 image replies omitted. Click reply to view.

 No.40154

>>40140
because it's full of dust

 No.40159

>>40154
/soy/ is dusty too
ARCHIVE >>>/nate/ NOW!

 No.40162

>>40154
Gemmier than /soy/

 No.40186

If the archive grabs everything posted as it's posted then what's the point of having the delete post feature on?

 No.40193

File: 1663802484454.jpg (133.68 KB, 1280x720, consider the following.jpg) ImgOps

I'm pretty sure it isn't hard to scrape all the threads on this backup, right?
http://103.219.154.246/

 No.40194

>>40186
the only reason its on is because people forced kuz

 No.40195

>>40193
Thank you very much for this suggestion. We are currently scraping the in-tact API of that site, and its restored hunrdeds of threads from that era. This has greatly expanded our archives use, so thank you.

https://sp.logwarehouse.net/read.cgi/suggest/6784
https://sp.logwarehouse.net/read.cgi/suggest/8908

some old threads we've been able to archive because of this.

 No.40197

THE ORIGINAL SOYCLIPSE THREAD IS NOW ARCHIVED
https://sp.logwarehouse.net/read.cgi/raid/12645

 No.40200


 No.40202

File: 1663812083502.png (52.46 KB, 775x849, not amused.png) ImgOps

>>40195
archive >>>/nate/ and scrape it from that IP

 No.40203

>>40202
>Archive the coal board even doe all it would save it a bunch of B​B​Coal

 No.40204

File: 1663812584076.png (32.26 KB, 721x720, happy gemson.png) ImgOps

>>40203
Are you admitting that Rika is built for B​B​​C by default?

 No.40207

^ tranny moment

 No.40209

>>40073
i asked

 No.40210

>>40195
Good job.
Here's another backup from another time
http://185.77.225.223/
Also is there any chance threads from the wayback machine can be added?

 No.40211

>>40210
>Good job.
>Here's another backup from another time
>http://185.77.225.223/
Thanks, I'll add it too. These are greatly appreciate, as each one add upwards of 800 old thread to our archive.

>Also is there any chance threads from the wayback machine can be added?

No, the only reason these worked is because the api is completely in tact and independent from the DB. However, cloudflare messes with archived files, and archived API's probably dont even exist, so it would require significant extensions. If some break through appears that does allow this, we will post that announcement here

 No.40219

>>40036
FoolFooka is shit according to the desuarchive devs

 No.40221

>>40219
yeah its straight up doodoo. this kuz shit is probably better for our needs

 No.40222

foolfuuka is over engineered, kuz software is (generally) extremely simple and effective

 No.40242

>>40219
FoolFuuka was being replaced by wakarimasen devs, but unfortunately wakarimasen died. But it's shitty software, writing from scratch is a better option.
NOW ARCHIVE /nate/

 No.40367

Seeing that this is a first party archive and there is access to the backend, does Archiva also grab private data that Vichan stores, such as IP addresses?

 No.40368

>>40367
he will never reveal that information

 No.40441

File: 1663862129834.png (6.12 KB, 105x319, ClipboardImage.png) ImgOps

>>40367
>>40368
The archive uses our API to scrape posts, so it is in practice, a third party archive. No other data is stored. Pic related is all the data categories it stores.

 No.40444

File: 1663862376197.png (12.81 KB, 233x255, 1663812083502.png) ImgOps

>>40202
Tsmt, also /nate/ doesn't have enough threads, it has only 200 threads, unlike /soy/, /a/, and the rest of the boards, which have 400, fix this.

 No.40450

File: 1663862554031.gif (76.79 KB, 167x255, cobnam_style.gif) ImgOps

posting a 'son in this 'emmy bread

 No.40537

>>40536
I wish asking for /nate/ to be archive gave me 50 cents

 No.40882

>>39940
He should probably just partially open-source it, mainly when it comes to the back-end (I.e. the actual scraper, but modified to not specifically target the sharty at first), and a very simplistic version of the current front-end. That way, it’d just be a generic vichan archiver, and not really a sharty one
Would be great for archiving other altchans

 No.40980

you faggot cocksucker nigger monkey search doesn't work and so many threads I was looking for don't even exist
I went to each page searching for a thread that had "saving" in it, and I found nothing
fix it you log eater

 No.40981

>>40882
he said it uses the vichan api to scrape for threads. you can write one in maybe 50 lines of python

 No.40982

>>40441
koozy plz add an api to the archive as well plzzz

 No.41053

up

 No.41054

>>40982
what purpose would this serve

 No.41188

There's a problem with spoilered files where sometimes they don't get archived, there's also a problem with the >>>/nate/ board.

 No.41207

Will the archive crawl Yandex's caches?

Yandex has a HUGE backlog of threads (on the .ru domain) from the soot era that are not saved anywhere else.

 No.41322

>>41054
i want to access the archive without any images
>>39736
come on janny, plz do it

 No.41351


 No.41352

>>41351
not him but you should consider archiving >>>/nate/ fr

 No.41354

>>41352
it's filled with dust that no one wants.

 No.41355

>>41351
>saving bandwidth is... LE BAD!

 No.41356

>>41355
but why albeit?

 No.41357

>>41356
ok im finna be straight wif you fam, i want to scrape the whole thing for le heckin datahoarding

 No.41358

>>41354
everyone wants to see it archived though

 No.41359

>>41358
no wants want that, except you.

 No.41360

>>41359
i do you retarded tranny

 No.41361

File: 1664119388813.png (52.46 KB, 775x849, not amused.png) ImgOps

>>41359
I want that

 No.41362


 No.41364

>>41360
says the who wants a board made specifically made de get rid of tranny nas garbage

 No.41450

add a json api

 No.41484

make it load faster

 No.41616

>>39942
Retarded. If open-sourcing wasn't competitive every big tech company wouldn't do it



[Return][Go to top] [Catalog] [Post a Reply]
Delete Post [ ]
[ home ] [ suggest ] [ soy / qa / g / pol / raid / incel / int / a / muv / qst / tv / r9k / r / giga / x ] [ b / sneed / fap / webm ] [ overboard ] [ rules ] [ blog ] [ kuz ] [ wiki ] [ dailyjak ] [ booru ] [ archive ]