HN comments for: Stirling PDF: Self-hosted, web-based PDF manipulation tool

anotherhue

15 replies

1d14h

2024-05-03 03:53:42 UTC

Originally developed entirely by ChatGPT, this locally hosted web application has evolved to encompass a comprehensive set of features, addressing all your PDF requirements.

Well that's that then.

nolongerthere

8 replies

1d13h

2024-05-03 04:47:31 UTC

so do we trust it?

wakawaka28

3 replies

1d13h

2024-05-03 05:02:09 UTC

The claim, or the software? If the software was developed by ChatGPT it probably sucks. If it wasn't developed by ChatGPT, its author is a liar. So either way, fuck it.

secfirstmd

2 replies

1d9h

2024-05-03 08:59:32 UTC

Who cares if it was developed by ChatGPT if it actually works? Why not try it out first before crapping on someone's hard Open Source work?

wakawaka28

0 replies

23h9m

2024-05-03 19:17:18 UTC

Yeah let me waste a shitload of time on possible low-quality shit that might have spyware. Based on the advertisement I'm not convinced. I want PDF tools developed by experts, not monkeys pounding on keyboards or shysters who push stupid gimmicks.

langnutty

0 replies

1d4h

2024-05-03 13:27:39 UTC

PDF software / readers are reliably the least reliable, most likely to have security issues programs on any given machine. I for one would not run C code blindly copied from ChatGPT carelessly.

In this case I think the actual PDF bits are command line tools written by humans though, and just the web wrapping (some small initial piece of it) is originally from ChatGPT.

atoav

3 replies

1d11h

2024-05-03 07:12:35 UTC

I once tried 3 hours to get ChatGPT to write a correct cubic interpolation function. Everything it wrote was either not cubic or not an interpolation function (the resulting curve didn't pass through the input points).

So yeah.

franga2000

2 replies

1d9h

2024-05-03 08:27:29 UTC

Not sure what your point is. I probably couldn't write a correct cubic interpolation function in 3 hours if you stuck me in an empty room and kept giving me the same instruction with a small change every time. But I've definitely written software as complex as this project basically from memory with only the occasional glance at some framework docs.

Leetcode is a meaningless metric when evaluating application developers.

atoav

1 replies

11h18m

2024-05-04 07:07:36 UTC

What are you talking about?

This was an example of how I gave a LLM clear instructions on a function that is known to the internet. If I told you get to use google you probably could solve this "leetcode" task in 10 minutes.

So I picked a random known algorithm and asked ChatGPT to write a function with it, just to test it.

I agree that leetcode is a useless metric, but nobody informed me that something simple like cubic interpolation belongs to this.category now. Maybe I should re-evaluate my career.

franga2000

0 replies

8h54m

2024-05-04 09:31:48 UTC

I'm just saying that writing a small function that has to be perfectly correct is a very different task from from writing a small web app where the space of possible solutions is incredibly large.

LLMs don't have perfect recall, so it has to reconstructtthe answer from the information it retained from training. Your example is like asking it math problems: unless it had both seen and retained the specific one you've asked it, it can't answer correctly because doesn't have the capability to reason about the problem and solve it. But something like writing a web app is closer to writing a story, where each next block it spits out doesn't have to be exactly correct, it just has to get the job done.

As for leetcode, I guess it depends on your field, but my logic is this: if I came to an interview and was told to write a microservice that receives a POSTed file, extracts some data and puts it in a DB, that's a solid work-relevant skill test. But if they asked to to write some interpolation functions, I'd consider that a leetcode interview - closer to math than programming, not a good indicator of job ability, not representative of the kind of work I'll be doing.

jpnc

1 replies

1d13h

2024-05-03 04:53:46 UTC

I wonder what the point of that sentence is - to get picked up by HN? Kinda like how any product or service even tangentially related to data suddenly has 'by the way also AI and stuff' added somewhere on their landing page. You don't see 'Developed using intellisense' used in READMEs.

exe34

0 replies

1d11h

2024-05-03 06:45:57 UTC

You do see "sent from my iPhone"

frooodle

1 replies

1d9h

2024-05-03 08:54:00 UTC

Creator here, I think there is some confusion in my doc. It's not made with fully with chatgpt

It was initially created as a 24 hour challenge to make a full app with chatgpt 3.0 in a set time limit to test what chatgpt was like last year.

I posted on Reddit it got lots of demand and I turned it into a full app,the only full chatgpt was the first 24 hours, it's over a year later now

neitsab

0 replies

1d9h

2024-05-03 09:21:49 UTC

Yes, it would be a great idea to update the wording as there is no way to derive that from the current one.

Even being sympathetic, my thought reading this was "probably bad code quality/rotten core despite the great feature set".

You can have a "History/Background/Origin" section where you put exactly what you wrote in your comment and it will be fine.

This notwithstanding, thank you very much for developing this app! I will look into deploying it on my server, it will be of great help to people around me who often need manipulating PDFs but are not super technical!

arthurcolle

0 replies

1d13h

2024-05-03 04:57:22 UTC

That's amazing. What a time to be alive.

BonusPlay

0 replies

1d10h

2024-05-03 08:11:36 UTC

For me it means:

1) don't expose it to public internet

2) don't give it untrusted input

Which highly reduces the usability factor for me.

maweki

11 replies

1d11h

2024-05-03 07:23:59 UTC

I host this at home. I don't use it myself, I can use the linux CLI tools that its based on. But I prefer my wife to use this to convert/split/etc. her pdf files this way instead of using some random website or app (that uses the same cli tool anyway).

She doesn't mind either way. Seems to work well enough for her use cases.

nashashmi

8 replies

1d5h

2024-05-03 13:09:17 UTC

You should just get a perpetual license to a PDF tool for $70. Best money ever spent. And get a portable exe version too.

ziddoap

7 replies

1d4h

2024-05-03 13:34:51 UTC

They seem to have a working solution that they, and their wife, is happy with. Why should they spend money on a different solution?

nashashmi

6 replies

1d4h

2024-05-03 14:13:18 UTC

Because it is two birds with one stone? Because it is one less system to maintain? Because it is superior to a web based UI with mutliple tools under different headers? because it is simple for even the least technical?

Suggestion is to use a free Pdf xchange editor by tracker software. And print the document to pdf when done.

nirav72

3 replies

20h44m

2024-05-03 21:41:43 UTC

I'm running this in a container on my nas. There isn't anything to maintain with tools like this. If setup, properly - they keep running unless stopped. They can even be auto-updated when their is a new image. I have several types of utility apps I'm self-hosting. I rarely have to do anything other than keep the OS updated.

nashashmi

1 replies

19h17m

2024-05-03 23:08:43 UTC

What nas system do you use that runs containers? I tried the whole container thing and have to reboot my system every week

happymellon

0 replies

5h3m

2024-05-04 13:23:03 UTC

You had to reboot your system every week because you tried a container?

nirav72

0 replies

3h6m

2024-05-04 15:19:38 UTC

Just plain ubuntu server and docker installed. I use Portainer to manage the containers. Container images are updated automatically using watchtower.

sodality2

1 replies

1d4h

2024-05-03 14:26:11 UTC

Counterpoint: A web service is one thing to maintain that then works on all devices, including mobile. A PDF tool needs to be installed.

nashashmi

0 replies

1d3h

2024-05-03 15:20:24 UTC

That is a good point in the world of ipad, android, and iphone. But did you know that iOS has most of these features built in? I just discovered this now.

smartmic

1 replies

1d9h

2024-05-03 09:08:21 UTC

… use the Linux CLI tools it is based on

I am interested in this part. Here is what I found: https://pdfbox.apache.org/2.0/commandline.html

Since PDFBox is a Java application, it should work cross-platform, not just Linux. Please correct me if you mean something else.

maweki

0 replies

1d8h

2024-05-03 10:24:02 UTC

I am sorry. I was under the impression that, at least for some features, Stirling PDF also uses pdfjam or similar, instead of wholly delegating to PDFBox. I think the point still stands, that there are powerful cli tools and I wouldn't consider running this container just for myself.

Probably anybody who can get this docker container running, can use appropiate open source cli tools. So one would wonder about the target audience. I don't. ;)

But I do indeed LibreOffice's command line conversion features.

emarsden

9 replies

1d12h

2024-05-03 06:07:30 UTC

From the README: “Stirling PDF does not initiate any outbound calls for record-keeping or tracking purposes”. Beyond auditing the code, how could a potential user verify this claim in advance, and how can a web-based app help support such a claim (in particular when the app does need to make some web requests to operate, but only to a restricted list of URLs that might be listed in a manifest along the lines of a Content-Security-Policy for instance)?

This is a concrete problem when deploying apps that need the user to “upload” some sensitive content.

huygens6363

2 replies

1d10h

2024-05-03 07:27:14 UTC

Little snitch[1] can help you out when self-hosting. When not self-hosting, all bets are off and my default stance is "expect the worst".

[1] https://www.obdev.at/products/littlesnitch/index.html

Edit: LS is MacOS oriented. I'm sure there are others, but I'm not into it. I feel it should be an OS-level feature, but who am I.

vladvasiliu

1 replies

1d10h

2024-05-03 08:19:59 UTC

LS is MacOS oriented

There's opensnitch on Linux. There's also something similar on Windows but I don't remember what it's called.

mathfailure

0 replies

4h41m

2024-05-04 13:44:44 UTC

TinyWall for Windows.

apexalpha

1 replies

1d9h

2024-05-03 08:45:26 UTC

Just put a sniffer or network capture tool like Wireshark in between. Additionally you could restrict the apps network access entirely to just your local home network.

emarsden

0 replies

1d8h

2024-05-03 09:43:23 UTC

It seems that there is some missing tooling to make this convenient.

You can run a local bundle of HTML/JS/WASM in a web browser instance that you isolate (for example with firejail) to prevent network access. You distribute as a zip/tgz, but it's not obvious how to handle updates without a full redownload. Distributing with a full Electron-like interface is obviously overkill.

If you're running a web app that's hosted elsewhere (which will be much more convenient for most people), your web browser or the software isolation functionality (or firewall/proxy) needs to distinguish between the initial resource loads (approve) and later sneaky logging requests (ban).

There are Android applications such as TrackerControl that have related functionality (operates as a local VPN to filter all network requests and block tracking) but I don't know of convenient tools for the desktop (Linux, in particular).

TheCapeGreek

1 replies

1d11h

2024-05-03 06:27:01 UTC

That's a problem with just about any package, library or system you use in the end.

Open source runs in a large amount of trust, and we're all complicit.

emarsden

0 replies

1d11h

2024-05-03 06:56:41 UTC

Sure, but these types of applications are running in a web browser sandbox, which benefits from enormous engineering resources to protect the host computer from malicious actions by the remote code. I'm wondering whether this execution environment (augmented with some policy mechanism to allow apps to declare their URL access needs, a little like an AppArmor or network firewal policy) could also provide some guarantees concerning privacy or information security.

justsomehnguy

0 replies

1d9h

2024-05-03 08:56:22 UTC

in particular when the app does need to make some web requests to operate

A web app doesn't need to make an outbound web requests to operate. A user interacting with a web is the one initiating the requests.

You can give the access to the up through a HTTP proxy and you can filter out any outbound requests from the web app or even not configuring the network routing for the server hosting that app. That leaves you with only JS initiated requests in the rendered pages of the app.

arcastroe

0 replies

19h53m

2024-05-03 22:32:24 UTC

If you're self-hosting on kubernetes, you can set up network policies with deny-all egress rule for this deployment/pod. This would block all outward network calls.

noodlesUK

8 replies

1d3h

2024-05-03 15:06:10 UTC

One thing I think is really lacking from the PDF ecosystem is good open-source tools around signing. PDF signatures are something that is legally important in a lot of the world with regulations like eIDAS. Unfortunately it's extremely difficult to cryptographically sign PDF documents with tools other than Adobe's and some other (often more sketchy) proprietary tools. Even if you figure out how to use stuff like LibreOffice or poppler to sign, you'll struggle to obtain certs that will validate without spending an arm and a leg.

I really hope that someone will decide to step in and become the Let's Encrypt of PDF and S/MIME certs, because that will improve public trust significantly.

JumpCrisscross

2 replies

1d2h

2024-05-03 16:08:13 UTC

you'll struggle to obtain certs that will validate

You’ll be surprised how far you can go pasting a picture of your signature in Preview.

noodlesUK

1 replies

2024-05-03 17:41:45 UTC

Absolutely, and that's what I tend to do with my documents IRL, but I think it would be really nice if we could move to a world where signing documents digitally actually meant something more than `signature.png`.

In the EU, in order to have a legal guarantee of being treated as the same as a handwritten signature in all member states, you have to meet "Qualified Electronic Signature" level, which means cryptographic signatures and the involvement of some kind of trust services provider who validates the certificate used to sign. In practice this is rare, and ordinary electronic signatures a la Preview work for most things.

yencabulator

0 replies

22h42m

2024-05-03 19:44:11 UTC

involvement of some kind of trust services provider

Marketplace at least in the US has shown that once you have this, the actual cryptography really doesn't matter. All anyone seems to care about seems to be "We are company X and have been doing this business for Y years and here's our standard operating policy. We emailed address A at time T1 and the person reading that email address used our online services to electronically 'sign' the pdf P at time T2."

Everyone trusts Adobe/Dropbox/et al to make that claim, nobody cares about certificates and what not.

perfmode

1 replies

16h11m

2024-05-04 02:14:52 UTC

does Preview in macOS not fit the bill?

noodlesUK

0 replies

4h8m

2024-05-04 14:18:05 UTC

As far as I know preview doesn’t support creating or even viewing cryptographic signatures at all.

TheJoeMan

1 replies

19h36m

2024-05-03 22:50:16 UTC

You’re really right, I asked my IT guy who’s a windows server wizard about what it would take to implement basic PKI for internal document signing and he looked at me like I had 2 heads.

FredrikSE

0 replies

10h6m

2024-05-04 08:19:34 UTC

The PKI is not hard(Windows one works well), but to keep it secure and safe could be more challenging.

wfn

0 replies

1h55m

2024-05-04 16:30:55 UTC

Completely agree. Source: I build things for a company[1][2] which is a TRA and a QTSP (eIDAS parlance).

Two references which I promise will be interesting (re: qcerts and QES tooling):

- excellent open source library for working with PDFs and digital signatures (incl. PDF ones): https://github.com/MatthiasValvekens/pyHanko

- European Commission's DSS Tool (you can submit one PDF only, don't need both original and signed one): https://ec.europa.eu/digital-building-blocks/DSS/webapp-demo...

[1]: https://www.zealid.com/en/ - you can onboard remotely for free, download your qualified certs at https://my.zealid.com/en - upload, QES sign, download PDFs (all of these free) - or use our APIs to integrate into us (get in touch with us if you'd like the latter).

[2]: opinions are my own.

petepete

6 replies

1d11h

2024-05-03 07:22:38 UTC

I'd love it if this could be integrated into Paperless. Every now and then one of my scanned documents goes in upside down and I need to rescan. Clicking rotate, maybe reordering and letting it be rescanned would be great.

moritzruth

1 replies

1d9h

2024-05-03 09:07:22 UTC

The last minor release of paperless-ngx added merging, splitting and rotating.

Source: https://docs.paperless-ngx.com/changelog/#paperless-ngx-270

petepete

0 replies

1d6h

2024-05-03 11:45:58 UTC

Ah thanks. I'd actually updated but didn't read the changelog and hadn't noticed the new functionality!

Odenwaelder

1 replies

1d9h

2024-05-03 09:09:41 UTC

The latest version of paperless-ngx can rotate documents. Check it out!

petepete

0 replies

1d6h

2024-05-03 11:46:14 UTC

Perfect timing, thank you.

nashashmi

0 replies

1d5h

2024-05-03 13:10:35 UTC

Ouch. Rotate the document in a pdf reader. And Print to pdf.

babox

0 replies

1d9h

2024-05-03 08:57:53 UTC

me tooo

kapildev

4 replies

1d3h

2024-05-03 14:42:36 UTC

On the topic of PDFs, one thing I have always wondered is why there aren't any OpenSource pdf editors that are comparable to Adobe or Foxit. Does anyone know?

tiahura

2 replies

1d3h

2024-05-03 14:51:20 UTC

Probably for the same reason there aren't any OpenSource comparables to Word, Excel, Outlook, Autocad, Quickbooks, or any other flagship productivity software

snadal

1 replies

1d3h

2024-05-03 14:58:14 UTC

Not the same thing.

- Libreoffice can edit Word and Excel files.

- There are several tools to read a PST file. Anyway… pst files are not commonly being shared with other people so the need to accessing it is uncommon unless you already use outlook.

- Autocad and Quickbooks are far less used (less people) than PDF.

I also wonder why there are no more tools to edit PDF files. Maybe licensing issues?

Edit: formatting

tiahura

0 replies

1d3h

2024-05-03 15:15:43 UTC

Not the same thing.

- Libreoffice can edit Word and Excel files.

You are correct. Just because Libreoffice can mangle a docx file doesn't make them truly comparable.

emacs can edit a .sqlite file, but they aren't comparable.

The issue is the insanely massive investment needed to build and sustain a project with millions of lines of code for something that isn't particularly sexy.

trueismywork

0 replies

10h37m

2024-05-04 07:48:35 UTC

What features are you looking for? Okular has served me well.

GlacierFox

4 replies

1d9h

2024-05-03 08:51:39 UTC

Anyone got an alternative that wasn't developed entirely in ChatGPT?

tyre

0 replies

1d5h

2024-05-03 13:13:07 UTC

The creator is in the comments, it wasn’t entirely developed with ChatGPT. The first spike in 24h was. The rest of the past year of development was human.

shiftingleft

0 replies

1d8h

2024-05-03 09:38:19 UTC

Looking at a few files, there's definitely some generated comments in there. Do you have any method to quantify how much of it is (likely) generated?

cjblomqvist

0 replies

1d8h

2024-05-03 10:05:01 UTC

Anecdotally from previous HN discussions, I believe a lot of the code is now not developed by ChatGPT anymore.

Update: As evident by the author's comment below, it's definitely not made by ChatGPT anymore (in any major way)

Alifatisk

0 replies

1d8h

2024-05-03 09:38:33 UTC

On mac, the preview app.

On Windows, okular. But honestly, pirating Acrobat is the best way if you have a tough economy.

n3storm

3 replies

1d12h

2024-05-03 05:44:21 UTC

I hope they will add digital signing with certificates capabilities.

exe34

1 replies

1d11h

2024-05-03 06:45:14 UTC

I guess you could reverse proxy with nginx?

atoav

0 replies

1d11h

2024-05-03 07:14:55 UTC

As you probably should?

nip

0 replies

1d11h

2024-05-03 06:48:03 UTC

Out of curiosity (and self interest [1]), what is your use case for digital signing and verification?

My understanding is that it’s more about trust (Docusign being the leader) than anything else: one can provide certificate signing and verification, but the trust in the owner of the certificate is the crux of the matter

[1] I’m the developer behind SimplePDF.eu

TechDebtDevin

3 replies

1d12h

2024-05-03 05:27:25 UTC

I feel like everytime I log onto this site I encounter a finished project that's very similar to something I'm working on. I'm too slow (or project hop too much).

Not that PDF related tools are uncommon but yeah I think people understand the sentiment.

I'm also very surprised that <redacted> for profit companies in the Document-manipulating/signing/storing still exist outside of niche industries (healthcare, govt, law) that require audit-trails and other regulatory specifics. I guess SEO still rules.. If anyone wants to make some money call up all the biggest real estate firms in your area and ask them how much they spend on contract signing or related services (it's a lot) and then offer them to do this for half the amount ( I can sign 20-30k documents for <$100 a month, and could be cheaper probably) Your average real estate firm is paying .50c-$1 a signature if they are uninformed, there's a lot of the uninformed.

jiriro

1 replies

1d2h

2024-05-03 15:44:07 UTC

Could you please elaborate how do you sign 10k documents for $100/month?

TechDebtDevin

0 replies

15h56m

2024-05-04 02:30:10 UTC

Depends on the size of the documents but it's very straightforward. As someone mentioned it's reputation that is the expensive part.

cess11

0 replies

1d10h

2024-05-03 08:04:12 UTC

Manipulation and signing is the easy part. Robust storage, retrieval and deletion aren't so easy.

Reputation often matters more than price in this area, because the pricey services amount to peanuts compared to the business as a whole. It's like doing price optimisation on toilet paper in the office. And reputation is generally interpreted as a proxy for reliability and a guarantee that there will be someone to sue if things go bad.

nikisweeting

2 replies

11h45m

2024-05-04 06:41:21 UTC

I have some questions about the Github Star history, it's very unusual to see a ~1 year old repo with 20k+ stars.

It went from 6k to 15k+ stars in a few days around 2023 Christmas when HN/Github/Reddit traffic is usually lowest, and I didn't see a corresponding social media post or announcement around that time with that kind of traffic.

If I'm wrong and there is some big social media post / promo that I missed, I apologize, I'll eat my shorts!

https://star-history.com/#Stirling-Tools/Stirling-PDF&Date

https://www.google.com/search?q=%22stirling%22+%22PDF%22&sca...

frooodle

0 replies

3h22m

2024-05-04 15:04:08 UTC

Creator here, there was a big HN post that caused that big boom during Christmas

Froodle

0 replies

27m

2024-05-04 17:58:40 UTC

https://news.ycombinator.com/item?id=38765627

haidev

2 replies

1d13h

2024-05-03 05:18:47 UTC

How is this entirely developed by ChatGPT? Does the author claim that all the code in there was generated by ChatGPT?

frooodle

1 replies

1d9h

2024-05-03 08:53:30 UTC

Creator here, I think there is some confusion in my doc. It's not made with fully with chatgpt

It was initially created as a 24 hour challenge to make a full app with chatgpt 3.0 in a set time limit to test what chatgpt was like last year.

I posted on Reddit it got lots of demand and I turned it into a full app,the only full chatgpt was the first 24 hours, it's over a year later now

figmert

0 replies

1d7h

2024-05-03 11:07:59 UTC

I'd love to know how you did that initial 24 hour challenge. Maybe a blog post or something? If I were to go about creating a start to an application using ChatGPT, how would I go about it?

sidcool

1 replies

1d11h

2024-05-03 07:03:40 UTC

The test coverage is pretty low. I'm raising a PR for this. Code quality not the best.

exac

0 replies

1d10h

2024-05-03 07:38:06 UTC

Well it was developed initially by ChatGPT. First file I open I see repeated comments.

https://github.com/Stirling-Tools/Stirling-PDF/blob/7f577a60...

showerst

1 replies

1d3h

2024-05-03 14:50:55 UTC

On a related note, does anyone have a good solution for highlighting arbitrary spans of text in PDFs? Trying to make viewing search results easier, but most of the solutions I've found are pretty lousy.

Jackson_Fleck

0 replies

18h27m

2024-05-03 23:59:22 UTC

I think you can hack together something in pdf.js but you have to deal with the pain of digging through its code. I’m working on something of the sort in an application I’ve built but it’s v much “nice to have”.

If you’re talking about raw pdfs then you are at the whim of the encoding surely? I’ve always found Adobe etc to have utterly crap searches

mathfailure

1 replies

4h35m

2024-05-04 13:50:24 UTC

I once used it on a pdf file with a scanned text form trying to make it more contrast, as the scan was hardly seen. What I needed was basically just making dark stuff darker (up to the point of making it black) and maybe a bit thicker to make it more visible.

The tool failed to help me with such a seemingly menial task, the improvement was very small. I even tried to repeat the step multiple times, but after like 2nd use there were no visual differences anymore (but the file's size kept actually changing).

jacob019

0 replies

4h0m

2024-05-04 14:26:06 UTC

That's not really a PDF thing. In that case the PDF is a thin wrapper around raster images. Extract the images, increase the contrast with gimp or imagemagick, and make a new PDF. You could script this if you had a lot of them.

Beijinger

1 replies

1d11h

2024-05-03 07:18:18 UTC

At least it is open source. It never hurts to have something web based but I prefer an application. I use a commercial product for Linux. Master PDF Editor. It is good, but their copy protection sucks. Better don't forget to "deregister" ist before wiping your harddrive, otherwise your code won't work. But there is always customer service....

franga2000

0 replies

1d9h

2024-05-03 08:34:03 UTC

I generally agree, but I also very often need to quickly do something basic to a PDF from my phone (like split a 2-column layout to single-column so it's actually readable) or when doing something on someone else's computer and little web tools like this are great for that.

And yes, Master PDF Editor is an amazing piece of software! It makes creating PDF forms so easy that every time I get a PDF to fill out, I make it a form and send back an empty copy too so whoever sent it to me can use that instead. I've gotten a few smaller organisations to start using mine instead.

thrdbndndn

0 replies

1d11h

2024-05-03 07:14:47 UTC

I see it uses pdfbox and pdf.js, so I assume it's mainly a frond end of these tools?

tacocataco

0 replies

1d3h

2024-05-03 15:18:18 UTC

One of firefox's more recent updates added a PDF editor.

I think people's perception of forefox is from several versons ago. As a daily user throughout its history, Firefox has made alot of progress over the years IMO.

Give it another shot if it's been a while.

siva7

0 replies

1d9h

2024-05-03 09:02:53 UTC

Originally developed entirely by ChatGPT

I would be careful with such wording as one could easily come to the conclusion that this tool was developed by the ChatGPT team. Nevertheless that this software certainly wasn't entirely developed by ChatGPT which is technically not possible but WITH the assistance of an AI tool.

mstijak

0 replies

1d11h

2024-05-03 06:34:41 UTC

Working on a product that could be similarily described.

CxReports: Self-hosted, web-based PDF reporting tool.

https://www.cx-reports.com

llagerlof

0 replies

1d13h

2024-05-03 05:11:42 UTC

When I saw this project months ago I instant loved it. I installed and start using it for real and... I found many bugs. Some tools were unusable.

I really hope it's better now.

junto

0 replies

22h40m

2024-05-03 19:45:40 UTC

I’m looking at the list and trying to figure out if there is an API to extract the text from a PDF? Or retrieve it as vectors / embeddings?

bustedagain

0 replies

1d1h

2024-05-03 16:56:14 UTC

Can someone convert this mumble jumble of docker api endpoint locally environment into a working software that people can actually install and use on their pc/mac/phone???

This seems too complicated to perform simple tasks of split merge edit not to mention the GBs of space docker and dependencies will take.

Thank you

brnt

0 replies

5h55m

2024-05-04 12:30:59 UTC

Now do a wasm version.

Jackson_Fleck

0 replies

23h38m

2024-05-03 18:48:00 UTC

I’m not sure if this uses unoconv with LibreOffice but my heart goes out to anyone trying to develop Pdf/document manipulation tools with the current Python libraries. A thankless task.

The more you work on this stuff the more you hate proprietary formats as well as having to rely on open source repos operated at the whim of a few good people.

2Gkashmiri

0 replies

1d1h

2024-05-03 16:42:37 UTC

Is there a PDF templating tool that lets you GUI modify elements on a PDF to be used to generate reports and invoices for example ?

Backend has Python and preferably agpl