[elinks-users] unicode conversion
Rick R
rick.richardson at gmail.com
Fri Jan 25 08:18:19 MST 2008
That works like a champ, thanks.
One more minor thing.
When converting french, the é, è and à are being converted to *.
(hex codes E8, E9 and E0)
Is there something I can do to fix this?
A test string: "Le phénomène à souligner est certainement la
croissance du prix moyen de vente par rapport à T3 2006 et par rapport
à T2 2007",
The result: "Le ph*nom*ne * souligner est certainement la
croissance du prix moyen de vente par rapport * T3 2006 et par rapport
* T2 2007"
Thanks
On Jan 25, 2008 2:59 AM, Kalle Olavi Niemitalo <kon at iki.fi> wrote:
> "Rick R" <rick.richardson at gmail.com> writes:
>
> > I've found out the core of the symptom.
> > The FDBFBFBFBFBD is the result of the value UCS_NO_CHAR (0xFFFFFFFD)
> > being munged by the function encode_utf8
> >
> > I'm guessing that the encode function is being called during the
> > write-file routine, my question is why that UCS_NO_CHAR is included at
> > all, when the appropriate character is being passed in just before it.
>
> "This is also used as a placeholder for the second cell of a
> double-cell character." However, ELinks should filter those
> placeholders out when dumping. This has been fixed in ELinks
> 0.13.GIT with the following patch, which I'm now going to apply
> to 0.12.GIT too.
>
> Fix -dump with multicell characters
>
> ---
> commit e0f0112de9cf81f3f387ce12d78d50ecd2547e97
> tree 293eec3dd56b3f2b423d1c160342181d7f68c41a
> parent e1f0c10926f4382f8f0cf99cb99221c52742fa4c
> author Miciah Dashiel Butler Masters <miciah at myrealbox.com> Sat, 01 Sep 2007 17:04:58 +0000
> committer Miciah Dashiel Butler Masters <miciah at FroodyComp.localdomain> Sat, 01 Sep 2007 17:04:58 +0000
>
> src/viewer/dump/dump.c | 3 +++
> 1 files changed, 3 insertions(+), 0 deletions(-)
>
> diff --git a/src/viewer/dump/dump.c b/src/viewer/dump/dump.c
> index 123e29c..0961fad 100644
> --- a/src/viewer/dump/dump.c
> +++ b/src/viewer/dump/dump.c
> @@ -918,6 +918,9 @@ utf8:
> if (write_char(*utf8_buf++,
> fd, buf, &bptr)) goto fail;
> }
> +
> + x += unicode_to_cell(c) - 1;
> +
> continue;
> }
>
>
> _______________________________________________
> elinks-users mailing list
> elinks-users at linuxfromscratch.org
> http://linuxfromscratch.org/mailman/listinfo/elinks-users
>
>
--
"Myths and legends die hard in America. We love them for the extra
dimension they provide, the illusion of near-infinite possibility to
erase the narrow confines of most men's reality. Weird heroes and
mould-breaking champions exist as living proof to those who need it
that the tyranny of 'the rat race' is not yet final." -- Hunter S.
Thompson
More information about the elinks-users
mailing list