- i am trying to do something along these lines
- What I am looking for is "informationally dense" articles across a blog
- For example, a person writing "Ai is going to do this" and "how I felt at my company when they adopted AI" are pure opinions.
- ON the other hand, a post like "here are 10 ways to loop a directory in bash" is informationally dense
- What sort of techniques / algorithms do you think I could use to narrow it down. I can think of removing stop words from the post, counting the ratio of remaining words to total words (not sure if that means anything), n gram analysis maybe but I am really not an expert at this
- Perhaps someone at HN can shed some light on how to go about identify "information rich" articles on a blog
- Do you think LLMs would do a good job if we were to loop through every post on a blog and ask LLMs to pick non opinion ones
My workflow is a little simpler I just print to PDF and try to keep it in multiples of 4. Then Adobe Reader has a nice Booklet print setting that allows you to print pamphlets of 4 PDF pages per single printer paper leaf. After you print just fold in half.
Access to kdp and time to do this varies.