| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Mew has a mechanism for determining the character set of the transfer form for both singlepart and multipart.
<Singlepart>
When you type ‘C-cC-m’ or ‘C-cC-c’ to compose a message in Draft mode, Mew decides the character set of the transfer based on the internal representation of its body. On Bilingual Emacs, US-ASCII is chosen for 7bit character sets while ISO-8859-1 is selected for 8bit character sets. On Internationalized Emacs, the character set of the transfer form is chosen based on rules defined by Mew.
<Multipart>
Since the data to be attached as a part of multipart is a file, it is stored on disk. Therefore, in order to determine its character set for the transfer form, it is necessary to load the file into an Emacs buffer converting it into internal representation. After that, Mew determines the character set of the transfer form for the file by the same method as singlepart.
On Bilingual Emacs, Mew reads a file as is. So if the file is 7bit, US-ASCII is chosen. Otherwise ISO-8859-1 is selected.
On Internationalized Emacs, Mew reads a file according to the local convention (i.e. auto conversion). The command to set a local convention is ‘C-x RET l’.
For example, in Japan, ISO-2022-JP, EUC-JP, and Shift_JIS is readily guessed and stored in buffer as internal representations for Japanese. Mew chooses ISO-2022-JP as the character set of the transfer form from the internal representation. That is, even if the character set of the file is EUC-JP or Shift_JIS, it is automatically converted into ISO-2022-JP, which is the transfer form for Japanese. This means that you can attach a file without needing to pay attention to its character set.
If you want to specify the coding-system of a file to be attached, type ‘I’. Let's call the character set "input character set". Also, if you want to specify the coding-system of the transfer form, use ‘C’.
Information regarding the character set is displayed in parentheses. If the character set of the transfer form is specified explicitly, it is displayed. Otherwise, if the input character set is specified, it is displayed with "*". Otherwise, "guess" is displayed.
Let's look at the following example. Since part 1 is a body, it is stored in an Emacs buffer. Because "guess" is displayed, it is Mew that determines the character set of the transfer form according to the rules that Mew defines.
Since iso-8859-1 is specified as the input character set, Mew loads the file considering that its character set is iso-8859-1, and then converts it into internal representation. The character set of the transfer form is decided according to rules defined by Mew.
The input character set in part 3 cannot be identified just from this example. (But a user certainly knows what it is since he actually specified it.) In any case, a file will be loaded and be converted into internal representation, then be converted into EUC-JP which is specified as the character set of the transfer form.
----------------------------- attachments -----------------------------
Multipart/Mixed 1/
1 Text/Plain(guess) CoverPage*
2 Text/Plain(*iso-8859-1) text1
B 3 Text/Plain(euc-jp) text2
4 .
--------0-1-2-3-4-5-6-7-8-9--------------------------------------------
|
Note that both ‘C’ and ‘I’ are not available on Bilingual Emacs.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] |
This document was generated by U-X61T\saito on May, 13 2008 using texi2html 1.78.