@Treczoks

Treczoks@lemmy.world · 5 days ago

Good if you are rated by an AI that pays for LOCs.

Treczoks@lemmy.world · 5 days ago

Happened to me, too.

Treczoks@lemmy.world · 9 days ago

“Accidentally”

Treczoks@lemmy.world · 15 days ago

This depends on what you are actually looking for, and how you are looking for it.

Do you really need pattern matching, or do you only look for fixed strings? Then other tools may be faster.

If you need case independent search on an upper- and lowercase data set, make a copy that is all upper or all lower, and search there.

If you only search in certain columns, make a copy that only includes these.

Or import the data into a database.

Treczoks@lemmy.world · 17 days ago

The killer collective.

Treczoks@lemmy.world · 17 days ago

We just celebrated 28 years of this development, so 1997. We live here since 2002.

Treczoks@lemmy.world · 18 days ago

I remember installing a fresh PC with win98. During installation, I disabled some windows bloatware (Imagine! You actually could do this!), and ended up with an unresponsive, non-windows app blocking the system. I killed that app and removed it from the system. Keep in mind that at this point, no network connection was set up, nor did I install any driver or program yet, this was straight from the windows install medium.

After reboot, the app was back, and again blocking the system.

Wiping the harddisk and starting installation over did not help either.

Turned out this was some bloatware installed by the BIOS whenever it detected at boot that there was a) a Windows installation that was b) “missing” their “register your PC with us” app. This needed some Windows bloatware to work, and thus failed on this machine.

This was the only time I angrily screamed at a hotline worker.

Treczoks@lemmy.world · 19 days ago

First of all, he should drop Python for anything resource intensive as such a simulation. And then think about how to optimize the algorithm.

Treczoks@lemmy.world · 23 days ago

As a kid, I found a box of dried prunes. They were so soft and tasty, I ate the complete box.

Definitely not on my list of repeats.

Treczoks@lemmy.world · 23 days ago

Well, don’t visit the US. This is just one of the many reasons to avoid that place.

Treczoks@lemmy.world · 24 days ago

“For our profits, it is reasonable not to waste money on AC.” – Your boss

Treczoks@lemmy.world · 24 days ago

Well, this would be more calling the cops on the DoorDash driver.

Treczoks@lemmy.world · 1 month ago

Yes, this works with most stickers, but there are some tough bastards that even resist that.

Treczoks@lemmy.world · 1 month ago

The compressing and renumbering seems to be more common with embedded Chinese fonts - Space-wise it makes a lot of sense. But yes, mark and copy text, paste it into word or writer, and you get gibberish. Can’t verify the search, though. And, of course, Google translate can’t do anything with it, either.

Treczoks@lemmy.world · 1 month ago

If you ever need to edit a PDF that way, just use Inkscape. It is way better than LO draw for that.

Treczoks@lemmy.world · 1 month ago

It is not a curse. It does exactly what it is intended to do: Create an archive of a document that is universally reproduceable.

It is a very well designed cul-de-sac for exactly this purpose. Using it for anything else is calling for trouble.

Treczoks@lemmy.world · 1 month ago

The problem lies in the PDFs themselves. In there are objects that represent lines of glyphs. If you are lucky. A conversion tool can guess which of those lines belong together and produce the text.

It cannot know any intentions behind it, though. Take a numbered list. The first line is two line objects: the number plus the . or the ), and the first line of text. The conversion tool can now guess. As the line blocks with the numbers are all left of the line blocks with text, this could be a numbered list. Or it could be a table with two columns. Nothing in the PDF is giving any hints.

And that is the easy part. This assumes that the document either uses default fonts, or keeps its embedded fonts untouched. If they use embedded fonts and a PDF optimizer that only embeds the used characters and renumbers them, any copy or conversion tool is bound to fail.

Same with protected PDFs where you simply cannot copy the text from the start.

And then there are PDFs that just consist of scanned pages. Here you would need an OCR software to get something readable out of them.

PDF is an archival, output format, the end of a process. Not something to work from.

Always preserve the original file. Keep it safe. If you change tools, make sure you have a conversion path into something editable. The PDF is for giving away, nothing else.

Treczoks@lemmy.world · 2 months ago

And then sue them to kingdom come.

Treczoks@lemmy.world · 2 months ago

I had to design a volume-limiting system for one of our devices that uses headphones. We know that the users turn the volume up to unhealthy levels - more often than not because their hearing is already damaged from listening for years or decades to systems that had no limitation. They are still able to turn the volume up with the (analog) amplifier, but we measure the signal, and if it exceeds the legal limit, we scale it down digitally.

Treczoks@lemmy.world · 2 months ago

They had no Linux driver back then at all, but there were some rudimentary from the community that printed Ok. They just did not support special printing modes, which i wanted to add.