Deutsch   English   Français   Italiano  
<vkf98h$21dsg$1@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Don Y <blockedofcourse@foo.invalid>
Newsgroups: sci.electronics.design
Subject: Re: pdf page counting
Date: Tue, 24 Dec 2024 14:32:00 -0700
Organization: A noiseless patient Spider
Lines: 71
Message-ID: <vkf98h$21dsg$1@dont-email.me>
References: <vc2hmjhc9qcfnaa054ej50ii65k3u4r8lj@4ax.com>
 <vka38r$qekf$1@dont-email.me> <72ahmjhc9qcfnaa054ej50ii65k3u4r8sh@4ax.com>
 <vkacja$sa01$1@dont-email.me> <gs3jmjpioj9d2pn95qji9jd23cvd3b5l01@4ax.com>
 <vkcar9$1b1m7$1@dont-email.me> <17dlmj9g00pa728rellhc8cknj0i6cgk22@4ax.com>
 <vkf268$20d9a$1@dont-email.me> <vkf4ab$20nml$1@dont-email.me>
 <lt0jf4F4bieU1@mid.individual.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Tue, 24 Dec 2024 22:32:02 +0100 (CET)
Injection-Info: dont-email.me; posting-host="6de4f85a2ea0348558f2e6ef8f4c4ee4";
	logging-data="2144144"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX18wjn9BPqcwGMaOTxiS5NBM"
User-Agent: Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:102.0) Gecko/20100101
 Thunderbird/102.2.2
Cancel-Lock: sha1:yHZx0eJgWojvEGhlPmTk8UrYIk4=
Content-Language: en-US
In-Reply-To: <lt0jf4F4bieU1@mid.individual.net>
Bytes: 5109

On 12/24/2024 1:21 PM, Brian Gregory wrote:
> On 24/12/2024 20:07, Don Y wrote:
>> On 12/24/2024 12:31 PM, Don Y wrote:
>>> For folks thinking PDFs are just "electronic books", that's likely
>>> an acceptable tradeoff.  OTOH, if you want to explain the difference
>>> between the different speaking accents of New Yorkers, Bostonians,
>>> Chicagoans, etc., it's much easier to embed three audio clips
>>> and let the "reader" HEAR them instead of trying to describe them
>>> textually.  Similarly, instead of publishing N different views of
>>> a 3D object to give the reader an idea of how it is constructed,
>>> it's much easier to embed a 3D model IN the document and let the
>>> reader explore it in whatever manner HE deems appropriate.
>>>
>>> These are things that you can't do with paper books.
>>
>> Well, technically, you can distribute other *media* with a paper
>> book that addresses some of these issues.  E.g., I have a book
>> titled _Mouth Sounds_ that includes a (flimsy) "45" to be played
>> on a phonograph to "hear" the sounds described.  Others have
>> included CDs affixed to back covers, etc.
>>
> 
> I'm not completely sure you are fully understanding what the OP wants. I think 
> he has the page numbers showing correctly in the headers of all his pages but 

The page numbers are present in the footers of the SCANNED pages.  As they
are just "text" -- indistinguishable from any OTHER text on those pages
(i.e., they aren't TAGGED as "page numbers" but, rather, are recognized by
human readers as LIKELY being page numbers owing to their consistent
placement on the pages)

When creating a document that you want to eventually export AS a PDF,
you would typically *tag* the page number (possibly including any prefixed
and suffixed text like "Page " ## "of N").  So, any processing done by
the DTP tool (like creating an index, table of contents, list of illustrations,
list of tables, etc.) would know the *apparent* (as in "visible") page number
present on the particular page.

Similarly, one would "tag" section titles so that the tool could build a list
of section names (e.g., for the table of contents).

And, one would EMBED tags for index entries at arbitrary points in the
document.  So, I could embed a reference to this text <INDEX:tag example>
and whichever page it happened to be rendered on would be indicated adjacent
to an entry titled "tag example".

When you scan a document, its hard to infer the structure of the document
from the mono/multi-chrome images that the scanner "sees".

But, the scanner can definitely note that it is now processing the "fourth"
page (even if a human would see the page as marked as "ii").

> wants to have the reader itself know the page numbers he used in his headings 
> so that he can tell the reader to go to page 17 and it goes to the page with 
> "17" in the header and shows page 17 / whatever as it's status.

That's exactly what the doctored PDF that I provided does (at least using
Adobe's products).  I can type "II" (roman numeral 2) in the page number
box and it will move to the page bearing the LABEL "II" -- which happens
to be the tenth page in the document.

Similarly, I can type in "322" and end up on the page that is LABELED as
"322" (which is the 336th in the document) carrying the heading
"RECEIVING AND GRADING HONEY".

I've not experimented with how DUPLICATE "labels" are handled.  E.g.,
if there is another page LABELED as "322", will a search find it, too?
Or, will the search facility only find the *next* instance of "322"
(so, depending on where you are in the document, you might find the
first "322" or the second... or neither!)