A Fuller Bubbles RSS Feed

There’s a new, fun aggregator out there: bubbles.town. I think the pitch is “like Hacker News, but for independent blogs instead of Silly Valley hucksters”. So it’s got upvoting and comments and all of that stuff.

Like Hacker News, it also has feeds. The feeds are nicer than on Hacker News — they have snippets of texts from the pages they link to:

But it’s only a snippet, so I wanted to make a fuller feed, like I’ve been making for Hacker News.

So here it is: Fuller Bubbles.

The source code is on Microsoft Github. I’ve just tweaked the original script a tiny bit more, and it works by going through the feed and then grabbing text from the original pages and putting that in there.

The point of all this is to be able to read the articles themselves in your favourite feed reader, i.e., in Gnus, reading from the Gwene NNTP server. (Isn’t that your favourite feed reader?! WELL I NEVER.)

See? All purty.

Of course, it doesn’t work reliably — it’s getting more difficult all the time do to web scraping, and especially from data centres (and that’s where this is running). But it works well enough for it to be worth it, I think.

Over 1,000,000 Pages About Comics

One over, that is.

It’s another extremely important and major milestone for Mrs. Kwakk Wakk’s Comics Research Site: The number of digits in the page count has increased! One more digit! Whoho! Look at how puny the number was just a day ago:

Yeah, yeah.

What finally pushed the count over the limit was that I’ve continued to top up the “text pages from comics” category. (It has non-comics pages from comic books — i.e., letters pages, editorials, “hype pages” and the like.)

I think we’re now up to 50K issues? But the vast, vast majority of the comics are super-hero comics, so there’s not that many letters pages featuring “T. M. Maple”, for instance. So, it’s a very skewed collection.

At least so far — I guess I’ll continue to run the scripts, and perhaps some indie comics collections will show up eventually. I wouldn’t hold my breath, though.

(To put this into context — the “queue” is 3K items long, and we’re up to… item 283 now. *gulp* That’s 3TBs worth of comics so far, so if the queue is consistent, we’ll end up at 30TBs of data to sift for text pages… Possibly. But I guess many seeds as we get further out into the queue will be dead, so possibly not.)

Random Comics

Here’s some comics I’ve read over the past month.

This is the one that I’ve spent the most time with, by far. It’s a collection of DC westerns from 1970-73. It’s not something I would normally have bought, but Matt Seneca made an argument for it collecting a masterpiece — Son of Tomahawk by Robert Kanigher and Frank Thorne, so I thought I’d give it a go. I’ve been reading a couple stories here and there, so it’s taken me weeks to get through these six hundred pages.

And… is Son of Tomohawk a masterpiece? No, of course not. But it’s pretty darn good! Much better than what you’d expect for with something like this. The stories feel fresh, and Thorne obviously put a lot of work into this — it’s a propulsive read. Kanigher sort of ran out of stories to do towards the end, but it’s very enjoyable.

Have you noticed that DC artists (in the 70s) basically were in one of two camps? You had Neal Adams and all his imitators, and then you had the Joe Kubert school (heh heh) of art, and this book is 90% the latter. So Kubert himself, Thorne, Grey Morrow, Gil Kane in his grittier mode… It suits these stories very well, if course.

So these stories look swell.

It’s also fun to randomly stumble over a story pencilled by Adams and inked by Berni Wrightson.

I was also impressed at how varied these stories are. I had the misfortune to read a whole bunch of Marvel westerns from a few years earlier, and they were formulaic to a fault, and were nigh unreadable after reading a couple, just by how boring it got. These pieces, on the other hand, were very entertaining… at least when reading a handful of these per sitting.

This on the other hand…

What is it about the translated books Fantagraphics are publishing these days? So many of them seem to come out of the same machine — not very attractively drawn, and with desaturated art, and aggressively middlebrow. It’s like Respectability 101 time. (And how did they get the paper this shiny? It’s almost impossible to snap anything here because of the reflections…)

Anyway, I didn’t manage to make it through this one. Very annoying book.

I did make it through this one, also from Fantagraphics.

It has the unfortunate reek of metaphor.

They’re fighting their way through a jungle! Endless! So symbolic!

I mostly felt bad for the artist who had to draw hundreds and hundreds of pages after the writer had turned in a half-assed script that feels like it took all of thirty minutes to write. That’s how much depth it has.

But perhaps it took longer to write, in between playing video games.

And I read a li’l stack of super-hero books. Anything of note?

Not really, but I enjoyed Web of Venom #1.

It’s very amusing.

Written by Jordan Morris… not a name that’s familiar to me, but I guess I should start keeping a lookout for more books from Morris.

Since I said what the best of them was, I guess I should say what the worst one was. So many to choose from, though, but I’ll go with The Uncanny X-Men Annual #1.

It’s just cringe worthy.

I assumed that this was a new album, but it’s from 2007, but newly translated. The Natacha series is very variable, but it’s sometimes great fun.

And Wathery is a solid artist… but there’s probably a reason this hasn’t been translated before.

It’s just very chaotic. Walthery starts the intensity at MAX and then holds it there for the entire album. It’s defatigant.

Speaking of French… I’ve read much of this series by Leo before, but I happened upon the first one here in a used bookstore, and I found it so relaxing to read in French that I ordered the rest from bdfugue.

The thing with Leo is that he has no — zero — sense of humour. There are no jokes, no wordplay and no slang. This makes it totally ideal for a beginner in French to read. These albums are some of the few where there usually isn’t a single sentence I don’t understand (although there are some words I have to guess at from the context, so I’m still learning while reading).

The other thing is that these books are so… science fictioney. They have all these strange and unnerving concepts that Leo keeps embroidering on and revealing more and more about. It’s no wonder these books are hugely popular.

And the artwork is extremely appealing. All these strange creatures, the vibrant colours, and the scale of the environs…

His only problem artwise is that he basically has one human form that he likes to draw for 90% of his characters: Slim and tall, and they have these narrow faces. Still, Leo manages to make them all distinguishable by varying hairdos, hair colour and dress.

Another thing — “ce n’est pas croyable, ça!” — Leo’s go-to villain is (usually) The Patriarchy: Our heroes often find themselves in societies/villages that are ruled by men who have decided that they control everything, and the way they seize control is always nightmarish, but oh so familiar. It’s great! (Don’t worry, they’re always defeated.)

I’ve read (most of) these books before in the Cinebook versions (translated to English). Cinebook is notorious for drawing shirts on all the topless characters (because they also sell these comics in Saudi Arabia, or something, and there’s No Boobs Allowed), but I think a much bigger problem is the colours. Cinebook prints on a highly absorbent paper, so the saturated colours Leo used are washed out and not as impressive. (And also 10% smaller page size, for some reason…)

Anyway, I’ll be reading all of Leo’s books in this series, I think. There’s about 20 of them? They read like TV series episodes…

And I read three issues of Spirou.

The kids that get lost all the time discover the secret to how Spirou is made — the publisher has all the artists chained up in the basement! Can the kids save them, are do they just get lost again!?

And this édito from Les Fabrices made me laugh.

OK, and that’s it? Yup.

Gadget Review: Agar Mini Keyboard

I got a package today! I thought it perhaps was that keyboard I ordered some time ago, and my confusion grew when I picked it up, because the box is, like, a couple kilos. And I only ordered a li’l keyboard!

Let’s unbox!

Yeah OK…

Still mysterious.

And the box contains… something that makes a noise? Is that a good sign?

Now then…

Yes! It’s a keyboard!

How cute.

And there’s… stuff…

And more stuff.

Oh, the other box had lots of alternate keycaps. So I guess the stuff is for prying off the caps if I want to.

So much stuff.

And, yes, the keyboard is small, but indeed — it’s heavy. It’s got a metal insert (that can be removed) to give it more heft.

It’s a portable size, so I guess the extra weight is to give you a better work-out while you’re out climbing mountains with the keyboard in your bag.

And what’s the range? This is a bluetooth keyboard, and I’ve had some with just horrible range. It’s important that this has a good range, because in my use case here, it’s going to be placed about three meters from the Raspberry Pi that it’s going to connect to.

Looking promising in my very scientific test, as you can see above, unless you get too sea sick.

So what is my use case, you ask? I mean… you do, doncha?

BE THAT WAY, THEN

As you probably won’t remember, I was looking for a way to hyper-optimise one thing: Grocery shopping. As you can read in that blog post, I ended up using the li’l laptop you see above, but that was only because this keyboard took its time to arrive. (Not through any fault of the manufacturer — it’s just that I bought the keyboard at a pre-sale; it wasn’t in production yet.)

I couldn’t find any other wireless keyboard with these specs — just the normal alphabet keys, space and return and no much else. And — it had to be a “real” keyboard, not something you’d thumb or anything.

And.. I think there’s too much going on visually — all the “alt” meanings (which I have no use for) are marked on the keyboard, so perhaps I can buy new key caps that are plainer?

But otherwise I think it looks pretty good. The range is good, and it allegedly has enough battery to last three months on a single charge. But on the other hand, I do have access to electricity where I’ve put it, so perhaps I can just leave it plugged in.

I tried it a couple times. There’s no screen here, of course, but you don’t need to see what you type to type bananas RET, which is what I type the most. It feels very nice. Because of the weight, it feels very solid, and doesn’t scootch around, and the key switches are clicky in a very satisfactory way.

So… I recommend it, I guess? Here’s the link again if you’re looking for something like this.

But I’m not absolutely sure whether I want to switch out my current solution. I kinda like the look of the li’l laptop? It’s so… technical.

(Oh, and I’ve now been using this system for a month, and I have to say that it works even better than I’d hoped. Whenever I’m cooking and noticing that I’m missing something or is low on something, I can just tap away at that keyboard without really thinking about it and then get on with what I was doing. That is, it doesn’t involve a distracting context shift. The poor delivery guy, though — this has led to bigger deliveries (at least the first few times) because I’m forgetting less of what I was going to buy. So the previous delivery was probably the biggest one I’ve ever had done. And it’s on the fourth floor without an elevator, so poor guy…)

Anyway! Nice keyboard.

Searchable comics text pages?

After finding a collection of Marvel Bullpen Bulletins for the search engine for magazines about comics, I started to wonder whether it’d be useful (or fun) to include text pages from comics in general. I mean — editorials, letters pages, “hype pages”… There’s information there that’s not available anywhere else.

So… perhaps? Maybe? Peut-être?

The first question is, of course: Can I lay my hands on a huge collection of scanned comics in the first place? And the answer is: Of course, the pirates are still pirating out there.

So I hacked an old torrent client to be more handy for this project. Like, if the torrent seems to be dead (either not responding within a few minutes, or stalling for a long time), then just abandon it. I’m after quantity, not quality, after all.

And after downloading, just hang up, like an animal. (Yes, I know, it’s very anti social towards the pirates, but I’m downloading things not to read, so er ok.)

After a few weeks, I had 1.7TB worth of comics CBR/CBZ files (about 28K files, which vaguely corresponds to 28K issues of random comics).

Now what.

Well, I unpacked them, and got 27K directories after removing duplicates. Then I had Claude write me a script to identify the text pages, and I deleted all the other pages. Then it wrote more scripts to deduplicate repeated pages (like company wide editorials and the like). And then I ran the resulting directories through my OCR/indexing setup, and viola:

Now you can find out whether Hulk would beat Superman in a fight, for instance.

While this hasn’t been a lot of work (perhaps a couple days?), each step in this process has taken a lot of time because there’s a lot of data to process. And the comics that were downloaded leaned hard on recent comics, which isn’t really what’s interesting for research porpoises.

So I dunno… is this useful?

I haven’t included this data set in the “main” categories — you have to go to https://kwakk.info/pages/ explicitly to search these pages.

If I had a better way to search for torrents of older comics, that would make this more interesting, I think, but I haven’t really found anything like that.

Heh, while futzing around here, I came across this cover. Surely this has to be the most 90s comics cover ever? Behold the anatomy! Wince! Turn away!

Anyway, anyway… It’s a really random selection of comics, and there’s no quality control whatsoever, of course, but it seems like it might be vaguely useful. There’s no way to identify what issue each page is from, except by looking at the cover, so I’ve included covers, too.

Perhaps I’ll download some further terrorbytes of comics? Perhaps not? Time will tell, I guess.