GNU bug report logs -
#36431
Crash in marker.c:337
Previous Next
Reported by: Werner LEMBERG <wl <at> gnu.org>
Date: Sat, 29 Jun 2019 11:19:01 UTC
Severity: normal
Done: Stefan Monnier <monnier <at> iro.umontreal.ca>
Bug is archived. No further changes may be made.
Full log
Message #62 received at 36431 <at> debbugs.gnu.org (full text, mbox):
> From: Stefan Monnier <monnier <at> iro.umontreal.ca>
> Cc: wl <at> gnu.org, 36431 <at> debbugs.gnu.org
> Date: Tue, 02 Jul 2019 13:51:30 -0400
>
> - we insert the new bytes at the beginning of the gap, in order to have
> room to grow if there are more bytes than expected, and also in case
> there are fewer bytes than expected (in which case we'd otherwise
> have to move the bytes we just read so they properly end at the end
> of the gap).
Also, you will see in insert-file-contents that it supports quitting
while reading a huge file, and also the REPLACE argument, where we
detect the same contents at beginning and end of the file and the
buffer.
> - decode_coding_gap wants the new input bytes to be at the end of the
> gap, so that we can put the decoded chars at the beginning of the gap
> and as one grows the other shrinks, so we don't need space for "IN +
> OUT" bytes but only for "OUT" bytes. Is that right (I'm trying to
> find some comment or other evidence that this is the case, but
> haven't found it yet).
That's right. The comment you are looking for (well, at least part of
it) is in the commentary before decode_coding, where it explains the
semantics of CODING->src_pos. You will see at the beginning of
decode_coding_gap how it sets things up according to that hairy
protocol.
> IOW, it should be possible to optimize the common case by reading the
> new bytes into the end of the gap to avoid moving everything in the
> common case (if the number of bytes read is different from originally
> expected, we'll have to do extra work, but for the common case where we
> know the file size upfront and it doesn't change while we read it, this
> will save us some work).
>
> But the effort is probably not worth the trouble: a memmove of a few
> gigabytes costs relatively little compared to the cost of actually
> decoding those same gigabytes.
Right. Also, there are the other subtle issues with quitting, the
REPLACE argument, special files, etc.
This bug report was last modified 5 years and 312 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.