Context Navigation

binary.n @ 42

Last change on this file since 42 was 25, checked in by landauf, 17 years ago
added tcl to libs
File size: 30.9 KB

Line
1	'\"
2	'\" Copyright (c) 1997 by Sun Microsystems, Inc.
3	'\"
4	'\" See the file "license.terms" for information on usage and redistribution
5	'\" of this file, and for a DISCLAIMER OF ALL WARRANTIES.
6	'\"
7	'\" RCS: @(#) $Id: binary.n,v 1.38 2008/01/02 21:21:37 dkf Exp $
8	'\"
9	.so man.macros
10	.TH binary n 8.0 Tcl "Tcl Built-In Commands"
11	.BS
12	'\" Note: do not modify the .SH NAME line immediately below!
13	.SH NAME
14	binary \- Insert and extract fields from binary strings
15	.SH SYNOPSIS
16	\fBbinary format \fIformatString \fR?\fIarg arg ...\fR?
17	.br
18	\fBbinary scan \fIstring formatString \fR?\fIvarName varName ...\fR?
19	.BE
20	.SH DESCRIPTION
21	.PP
22	This command provides facilities for manipulating binary data. The
23	first form, \fBbinary format\fR, creates a binary string from normal
24	Tcl values. For example, given the values 16 and 22, on a 32-bit
25	architecture, it might produce an 8-byte binary string consisting of
26	two 4-byte integers, one for each of the numbers. The second form of
27	the command, \fBbinary scan\fR, does the opposite: it extracts data
28	from a binary string and returns it as ordinary Tcl string values.
29	.SH "BINARY FORMAT"
30	.PP
31	The \fBbinary format\fR command generates a binary string whose layout
32	is specified by the \fIformatString\fR and whose contents come from
33	the additional arguments. The resulting binary value is returned.
34	.PP
35	The \fIformatString\fR consists of a sequence of zero or more field
36	specifiers separated by zero or more spaces. Each field specifier is
37	a single type character followed by an optional flag character followed
38	by an optional numeric \fIcount\fR.
39	Most field specifiers consume one argument to obtain the value to be
40	formatted. The type character specifies how the value is to be
41	formatted. The \fIcount\fR typically indicates how many items of the
42	specified type are taken from the value. If present, the \fIcount\fR
43	is a non-negative decimal integer or \fB*\fR, which normally indicates
44	that all of the items in the value are to be used. If the number of
45	arguments does not match the number of fields in the format string
46	that consume arguments, then an error is generated. The flag character
47	is ignored for for \fBbinary format\fR.
48	.PP
49	Here is a small example to clarify the relation between the field
50	specifiers and the arguments:
51	.CS
52	\fBbinary format\fR d3d {1.0 2.0 3.0 4.0} 0.1
53	.CE
54	.PP
55	The first argument is a list of four numbers, but because of the count
56	of 3 for the associated field specifier, only the first three will be
57	used. The second argument is associated with the second field
58	specifier. The resulting binary string contains the four numbers 1.0,
59	2.0, 3.0 and 0.1.
60	.PP
61	Each type-count pair moves an imaginary cursor through the binary
62	data, storing bytes at the current position and advancing the cursor
63	to just after the last byte stored. The cursor is initially at
64	position 0 at the beginning of the data. The type may be any one of
65	the following characters:
66	.IP \fBa\fR 5
67	Stores a byte string of length \fIcount\fR in the output string.
68	Every character is taken as modulo 256 (i.e. the low byte of every
69	character is used, and the high byte discarded) so when storing
70	character strings not wholly expressible using the characters \eu0000-\eu00ff,
71	the \fBencoding convertto\fR command should be used first to change
72	the string into an external representation
73	if this truncation is not desired (i.e. if the characters are
74	not part of the ISO 8859\-1 character set.)
75	If \fIarg\fR has fewer than \fIcount\fR bytes, then additional zero
76	bytes are used to pad out the field. If \fIarg\fR is longer than the
77	specified length, the extra characters will be ignored. If
78	\fIcount\fR is \fB*\fR, then all of the bytes in \fIarg\fR will be
79	formatted. If \fIcount\fR is omitted, then one character will be
80	formatted. For example,
81	.RS
82	.CS
83	\fBbinary format\fR a7a*a alpha bravo charlie
84	.CE
85	will return a string equivalent to \fBalpha\e000\e000bravoc\fR,
86	.CS
87	\fBbinary format\fR a* [encoding convertto utf-8 \eu20ac]
88	.CE
89	will return a string equivalent to \fB\e342\e202\e254\fR (which is the
90	UTF-8 byte sequence for a Euro-currency character) and
91	.CS
92	\fBbinary format\fR a* [encoding convertto iso8859-15 \eu20ac]
93	.CE
94	will return a string equivalent to \fB\e244\fR (which is the ISO
95	8859\-15 byte sequence for a Euro-currency character). Contrast these
96	last two with:
97	.CS
98	\fBbinary format\fR a* \eu20ac
99	.CE
100	which returns a string equivalent to \fB\e254\fR (i.e. \fB\exac\fR) by
101	truncating the high-bits of the character, and which is probably not
102	what is desired.
103	.RE
104	.IP \fBA\fR 5
105	This form is the same as \fBa\fR except that spaces are used for
106	padding instead of nulls. For example,
107	.RS
108	.CS
109	\fBbinary format\fR A6A*A alpha bravo charlie
110	.CE
111	will return \fBalpha bravoc\fR.
112	.RE
113	.IP \fBb\fR 5
114	Stores a string of \fIcount\fR binary digits in low-to-high order
115	within each byte in the output string. \fIArg\fR must contain a
116	sequence of \fB1\fR and \fB0\fR characters. The resulting bytes are
117	emitted in first to last order with the bits being formatted in
118	low-to-high order within each byte. If \fIarg\fR has fewer than
119	\fIcount\fR digits, then zeros will be used for the remaining bits.
120	If \fIarg\fR has more than the specified number of digits, the extra
121	digits will be ignored. If \fIcount\fR is \fB*\fR, then all of the
122	digits in \fIarg\fR will be formatted. If \fIcount\fR is omitted,
123	then one digit will be formatted. If the number of bits formatted
124	does not end at a byte boundary, the remaining bits of the last byte
125	will be zeros. For example,
126	.RS
127	.CS
128	\fBbinary format\fR b5b* 11100 111000011010
129	.CE
130	will return a string equivalent to \fB\ex07\ex87\ex05\fR.
131	.RE
132	.IP \fBB\fR 5
133	This form is the same as \fBb\fR except that the bits are stored in
134	high-to-low order within each byte. For example,
135	.RS
136	.CS
137	\fBbinary format\fR B5B* 11100 111000011010
138	.CE
139	will return a string equivalent to \fB\exe0\exe1\exa0\fR.
140	.RE
141	.IP \fBH\fR 5
142	Stores a string of \fIcount\fR hexadecimal digits in high-to-low
143	within each byte in the output string. \fIArg\fR must contain a
144	sequence of characters in the set
145	.QW 0123456789abcdefABCDEF .
146	The resulting bytes are emitted in first to last order with the hex digits
147	being formatted in high-to-low order within each byte. If \fIarg\fR
148	has fewer than \fIcount\fR digits, then zeros will be used for the
149	remaining digits. If \fIarg\fR has more than the specified number of
150	digits, the extra digits will be ignored. If \fIcount\fR is
151	\fB*\fR, then all of the digits in \fIarg\fR will be formatted. If
152	\fIcount\fR is omitted, then one digit will be formatted. If the
153	number of digits formatted does not end at a byte boundary, the
154	remaining bits of the last byte will be zeros. For example,
155	.RS
156	.CS
157	\fBbinary format\fR H3H*H2 ab DEF 987
158	.CE
159	will return a string equivalent to \fB\exab\ex00\exde\exf0\ex98\fR.
160	.RE
161	.IP \fBh\fR 5
162	This form is the same as \fBH\fR except that the digits are stored in
163	low-to-high order within each byte. This is seldom required. For example,
164	.RS
165	.CS
166	\fBbinary format\fR h3h*h2 AB def 987
167	.CE
168	will return a string equivalent to \fB\exba\ex00\exed\ex0f\ex89\fR.
169	.RE
170	.IP \fBc\fR 5
171	Stores one or more 8-bit integer values in the output string. If no
172	\fIcount\fR is specified, then \fIarg\fR must consist of an integer
173	value. If \fIcount\fR is specified, \fIarg\fR must consist of a list
174	containing at least that many integers. The low-order 8 bits of each integer
175	are stored as a one-byte value at the cursor position. If \fIcount\fR
176	is \fB*\fR, then all of the integers in the list are formatted. If the
177	number of elements in the list is greater
178	than \fIcount\fR, then the extra elements are ignored. For example,
179	.RS
180	.CS
181	\fBbinary format\fR c3cc* {3 -3 128 1} 260 {2 5}
182	.CE
183	will return a string equivalent to
184	\fB\ex03\exfd\ex80\ex04\ex02\ex05\fR, whereas
185	.CS
186	\fBbinary format\fR c {2 5}
187	.CE
188	will generate an error.
189	.RE
190	.IP \fBs\fR 5
191	This form is the same as \fBc\fR except that it stores one or more
192	16-bit integers in little-endian byte order in the output string. The
193	low-order 16-bits of each integer are stored as a two-byte value at
194	the cursor position with the least significant byte stored first. For
195	example,
196	.RS
197	.CS
198	\fBbinary format\fR s3 {3 -3 258 1}
199	.CE
200	will return a string equivalent to
201	\fB\ex03\ex00\exfd\exff\ex02\ex01\fR.
202	.RE
203	.IP \fBS\fR 5
204	This form is the same as \fBs\fR except that it stores one or more
205	16-bit integers in big-endian byte order in the output string. For
206	example,
207	.RS
208	.CS
209	\fBbinary format\fR S3 {3 -3 258 1}
210	.CE
211	will return a string equivalent to
212	\fB\ex00\ex03\exff\exfd\ex01\ex02\fR.
213	.RE
214	.IP \fBt\fR 5
215	.VS 8.5
216	This form (mnemonically \fItiny\fR) is the same as \fBs\fR and \fBS\fR
217	except that it stores the 16-bit integers in the output string in the
218	native byte order of the machine where the Tcl script is running.
219	To determine what the native byte order of the machine is, refer to
220	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
221	.VE 8.5
222	.IP \fBi\fR 5
223	This form is the same as \fBc\fR except that it stores one or more
224	32-bit integers in little-endian byte order in the output string. The
225	low-order 32-bits of each integer are stored as a four-byte value at
226	the cursor position with the least significant byte stored first. For
227	example,
228	.RS
229	.CS
230	\fBbinary format\fR i3 {3 -3 65536 1}
231	.CE
232	will return a string equivalent to
233	\fB\ex03\ex00\ex00\ex00\exfd\exff\exff\exff\ex00\ex00\ex01\ex00\fR
234	.RE
235	.IP \fBI\fR 5
236	This form is the same as \fBi\fR except that it stores one or more one
237	or more 32-bit integers in big-endian byte order in the output string.
238	For example,
239	.RS
240	.CS
241	\fBbinary format\fR I3 {3 -3 65536 1}
242	.CE
243	will return a string equivalent to
244	\fB\ex00\ex00\ex00\ex03\exff\exff\exff\exfd\ex00\ex01\ex00\ex00\fR
245	.RE
246	.IP \fBn\fR 5
247	.VS 8.5
248	This form (mnemonically \fInumber\fR or \fInormal\fR) is the same as
249	\fBi\fR and \fBI\fR except that it stores the 32-bit integers in the
250	output string in the native byte order of the machine where the Tcl
251	script is running.
252	To determine what the native byte order of the machine is, refer to
253	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
254	.VE 8.5
255	.IP \fBw\fR 5
256	This form is the same as \fBc\fR except that it stores one or more
257	64-bit integers in little-endian byte order in the output string. The
258	low-order 64-bits of each integer are stored as an eight-byte value at
259	the cursor position with the least significant byte stored first. For
260	example,
261	.RS
262	.CS
263	\fBbinary format\fR w 7810179016327718216
264	.CE
265	will return the string \fBHelloTcl\fR
266	.RE
267	.IP \fBW\fR 5
268	This form is the same as \fBw\fR except that it stores one or more one
269	or more 64-bit integers in big-endian byte order in the output string.
270	For example,
271	.RS
272	.CS
273	\fBbinary format\fR Wc 4785469626960341345 110
274	.CE
275	will return the string \fBBigEndian\fR
276	.RE
277	.IP \fBm\fR 5
278	.VS 8.5
279	This form (mnemonically the mirror of \fBw\fR) is the same as \fBw\fR
280	and \fBW\fR except that it stores the 64-bit integers in the output
281	string in the native byte order of the machine where the Tcl script is
282	running.
283	To determine what the native byte order of the machine is, refer to
284	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
285	.VE 8.5
286	.IP \fBf\fR 5
287	This form is the same as \fBc\fR except that it stores one or more one
288	or more single-precision floating point numbers in the machine's native
289	representation in the output string. This representation is not
290	portable across architectures, so it should not be used to communicate
291	floating point numbers across the network. The size of a floating
292	point number may vary across architectures, so the number of bytes
293	that are generated may vary. If the value overflows the
294	machine's native representation, then the value of FLT_MAX
295	as defined by the system will be used instead. Because Tcl uses
296	double-precision floating point numbers internally, there may be some
297	loss of precision in the conversion to single-precision. For example,
298	on a Windows system running on an Intel Pentium processor,
299	.RS
300	.CS
301	\fBbinary format\fR f2 {1.6 3.4}
302	.CE
303	will return a string equivalent to
304	\fB\excd\excc\excc\ex3f\ex9a\ex99\ex59\ex40\fR.
305	.RE
306	.IP \fBr\fR 5
307	.VS 8.5
308	This form (mnemonically \fIreal\fR) is the same as \fBf\fR except that
309	it stores the single-precision floating point numbers in little-endian
310	order. This conversion only produces meaningful output when used on
311	machines which use the IEEE floating point representation (very
312	common, but not universal.)
313	.VE 8.5
314	.IP \fBR\fR 5
315	.VS 8.5
316	This form is the same as \fBr\fR except that it stores the
317	single-precision floating point numbers in big-endian order.
318	.VE 8.5
319	.IP \fBd\fR 5
320	This form is the same as \fBf\fR except that it stores one or more one
321	or more double-precision floating point numbers in the machine's native
322	representation in the output string. For example, on a
323	Windows system running on an Intel Pentium processor,
324	.RS
325	.CS
326	\fBbinary format\fR d1 {1.6}
327	.CE
328	will return a string equivalent to
329	\fB\ex9a\ex99\ex99\ex99\ex99\ex99\exf9\ex3f\fR.
330	.RE
331	.IP \fBq\fR 5
332	.VS 8.5
333	This form (mnemonically the mirror of \fBd\fR) is the same as \fBd\fR
334	except that it stores the double-precision floating point numbers in
335	little-endian order. This conversion only produces meaningful output
336	when used on machines which use the IEEE floating point representation
337	(very common, but not universal.)
338	.VE 8.5
339	.IP \fBQ\fR 5
340	.VS 8.5
341	This form is the same as \fBq\fR except that it stores the
342	double-precision floating point numbers in big-endian order.
343	.VE 8.5
344	.IP \fBx\fR 5
345	Stores \fIcount\fR null bytes in the output string. If \fIcount\fR is
346	not specified, stores one null byte. If \fIcount\fR is \fB*\fR,
347	generates an error. This type does not consume an argument. For
348	example,
349	.RS
350	.CS
351	\fBbinary format\fR a3xa3x2a3 abc def ghi
352	.CE
353	will return a string equivalent to \fBabc\e000def\e000\e000ghi\fR.
354	.RE
355	.IP \fBX\fR 5
356	Moves the cursor back \fIcount\fR bytes in the output string. If
357	\fIcount\fR is \fB*\fR or is larger than the current cursor position,
358	then the cursor is positioned at location 0 so that the next byte
359	stored will be the first byte in the result string. If \fIcount\fR is
360	omitted then the cursor is moved back one byte. This type does not
361	consume an argument. For example,
362	.RS
363	.CS
364	\fBbinary format\fR a3X*a3X2a3 abc def ghi
365	.CE
366	will return \fBdghi\fR.
367	.RE
368	.IP \fB@\fR 5
369	Moves the cursor to the absolute location in the output string
370	specified by \fIcount\fR. Position 0 refers to the first byte in the
371	output string. If \fIcount\fR refers to a position beyond the last
372	byte stored so far, then null bytes will be placed in the uninitialized
373	locations and the cursor will be placed at the specified location. If
374	\fIcount\fR is \fB*\fR, then the cursor is moved to the current end of
375	the output string. If \fIcount\fR is omitted, then an error will be
376	generated. This type does not consume an argument. For example,
377	.RS
378	.CS
379	\fBbinary format\fR a5@2a1@*a3@10a1 abcde f ghi j
380	.CE
381	will return \fBabfdeghi\e000\e000j\fR.
382	.RE
383	.SH "BINARY SCAN"
384	.PP
385	The \fBbinary scan\fR command parses fields from a binary string,
386	returning the number of conversions performed. \fIString\fR gives the
387	input bytes to be parsed (one byte per character, and characters not
388	representable as a byte have their high bits chopped)
389	and \fIformatString\fR indicates how to parse it.
390	Each \fIvarName\fR gives the name of a variable; when a field is
391	scanned from \fIstring\fR the result is assigned to the corresponding
392	variable.
393	.PP
394	As with \fBbinary format\fR, the \fIformatString\fR consists of a
395	sequence of zero or more field specifiers separated by zero or more
396	spaces. Each field specifier is a single type character followed by
397	an optional flag character followed by an optional numeric \fIcount\fR.
398	Most field specifiers consume one
399	argument to obtain the variable into which the scanned values should
400	be placed. The type character specifies how the binary data is to be
401	interpreted. The \fIcount\fR typically indicates how many items of
402	the specified type are taken from the data. If present, the
403	\fIcount\fR is a non-negative decimal integer or \fB*\fR, which
404	normally indicates that all of the remaining items in the data are to
405	be used. If there are not enough bytes left after the current cursor
406	position to satisfy the current field specifier, then the
407	corresponding variable is left untouched and \fBbinary scan\fR returns
408	immediately with the number of variables that were set. If there are
409	not enough arguments for all of the fields in the format string that
410	consume arguments, then an error is generated. The flag character
411	.QW u
412	may be given to cause some types to be read as unsigned values. The flag
413	is accepted for all field types but is ignored for non-integer fields.
414	.PP
415	A similar example as with \fBbinary format\fR should explain the
416	relation between field specifiers and arguments in case of the binary
417	scan subcommand:
418	.CS
419	\fBbinary scan\fR $bytes s3s first second
420	.CE
421	.PP
422	This command (provided the binary string in the variable \fIbytes\fR
423	is long enough) assigns a list of three integers to the variable
424	\fIfirst\fR and assigns a single value to the variable \fIsecond\fR.
425	If \fIbytes\fR contains fewer than 8 bytes (i.e. four 2-byte
426	integers), no assignment to \fIsecond\fR will be made, and if
427	\fIbytes\fR contains fewer than 6 bytes (i.e. three 2-byte integers),
428	no assignment to \fIfirst\fR will be made. Hence:
429	.CS
430	puts [\fBbinary scan\fR abcdefg s3s first second]
431	puts $first
432	puts $second
433	.CE
434	will print (assuming neither variable is set previously):
435	.CS
436	1
437	25185 25699 26213
438	can't read "second": no such variable
439	.CE
440	.PP
441	It is \fIimportant\fR to note that the \fBc\fR, \fBs\fR, and \fBS\fR
442	(and \fBi\fR and \fBI\fR on 64bit systems) will be scanned into
443	long data size values. In doing this, values that have their high
444	bit set (0x80 for chars, 0x8000 for shorts, 0x80000000 for ints),
445	will be sign extended. Thus the following will occur:
446	.CS
447	set signShort [\fBbinary format\fR s1 0x8000]
448	\fBbinary scan\fR $signShort s1 val; \fI# val == 0xFFFF8000\fR
449	.CE
450	If you require unsigned values you can include the
451	.QW u
452	flag character following
453	the field type. For example, to read an unsigned short value:
454	.CS
455	set signShort [\fBbinary format\fR s1 0x8000]
456	\fBbinary scan\fR $signShort su1 val; \fI# val == 0x00008000\fR
457	.CE
458	.PP
459	Each type-count pair moves an imaginary cursor through the binary data,
460	reading bytes from the current position. The cursor is initially
461	at position 0 at the beginning of the data. The type may be any one of
462	the following characters:
463	.IP \fBa\fR 5
464	The data is a byte string of length \fIcount\fR. If \fIcount\fR
465	is \fB*\fR, then all of the remaining bytes in \fIstring\fR will be
466	scanned into the variable. If \fIcount\fR is omitted, then one
467	byte will be scanned.
468	All bytes scanned will be interpreted as being characters in the
469	range \eu0000-\eu00ff so the \fBencoding convertfrom\fR command will be
470	needed if the string is not a binary string or a string encoded in ISO
471	8859\-1.
472	For example,
473	.RS
474	.CS
475	\fBbinary scan\fR abcde\e000fghi a6a10 var1 var2
476	.CE
477	will return \fB1\fR with the string equivalent to \fBabcde\e000\fR
478	stored in \fIvar1\fR and \fIvar2\fR left unmodified, and
479	.CS
480	\fBbinary scan\fR \e342\e202\e254 a* var1
481	set var2 [encoding convertfrom utf-8 $var1]
482	.CE
483	will store a Euro-currency character in \fIvar2\fR.
484	.RE
485	.IP \fBA\fR 5
486	This form is the same as \fBa\fR, except trailing blanks and nulls are stripped from
487	the scanned value before it is stored in the variable. For example,
488	.RS
489	.CS
490	\fBbinary scan\fR "abc efghi \e000" A* var1
491	.CE
492	will return \fB1\fR with \fBabc efghi\fR stored in \fIvar1\fR.
493	.RE
494	.IP \fBb\fR 5
495	The data is turned into a string of \fIcount\fR binary digits in
496	low-to-high order represented as a sequence of
497	.QW 1
498	and
499	.QW 0
500	characters. The data bytes are scanned in first to last order with
501	the bits being taken in low-to-high order within each byte. Any extra
502	bits in the last byte are ignored. If \fIcount\fR is \fB*\fR, then
503	all of the remaining bits in \fIstring\fR will be scanned. If
504	\fIcount\fR is omitted, then one bit will be scanned. For example,
505	.RS
506	.CS
507	\fBbinary scan\fR \ex07\ex87\ex05 b5b* var1 var2
508	.CE
509	will return \fB2\fR with \fB11100\fR stored in \fIvar1\fR and
510	\fB1110000110100000\fR stored in \fIvar2\fR.
511	.RE
512	.IP \fBB\fR 5
513	This form is the same as \fBb\fR, except the bits are taken in
514	high-to-low order within each byte. For example,
515	.RS
516	.CS
517	\fBbinary scan\fR \ex70\ex87\ex05 B5B* var1 var2
518	.CE
519	will return \fB2\fR with \fB01110\fR stored in \fIvar1\fR and
520	\fB1000011100000101\fR stored in \fIvar2\fR.
521	.RE
522	.IP \fBH\fR 5
523	The data is turned into a string of \fIcount\fR hexadecimal digits in
524	high-to-low order represented as a sequence of characters in the set
525	.QW 0123456789abcdef .
526	The data bytes are scanned in first to last
527	order with the hex digits being taken in high-to-low order within each
528	byte. Any extra bits in the last byte are ignored. If \fIcount\fR is
529	\fB*\fR, then all of the remaining hex digits in \fIstring\fR will be
530	scanned. If \fIcount\fR is omitted, then one hex digit will be
531	scanned. For example,
532	.RS
533	.CS
534	\fBbinary scan\fR \ex07\exC6\ex05\ex1f\ex34 H3H* var1 var2
535	.CE
536	will return \fB2\fR with \fB07c\fR stored in \fIvar1\fR and
537	\fB051f34\fR stored in \fIvar2\fR.
538	.RE
539	.IP \fBh\fR 5
540	This form is the same as \fBH\fR, except the digits are taken in
541	reverse (low-to-high) order within each byte. For example,
542	.RS
543	.CS
544	\fBbinary scan\fR \ex07\ex86\ex05\ex12\ex34 h3h* var1 var2
545	.CE
546	will return \fB2\fR with \fB706\fR stored in \fIvar1\fR and
547	\fB502143\fR stored in \fIvar2\fR.
548	.RE
549	Note that most code that wishes to parse the hexadecimal digits from
550	multiple bytes in order should use the \fBH\fR format.
551	.IP \fBc\fR 5
552	The data is turned into \fIcount\fR 8-bit signed integers and stored
553	in the corresponding variable as a list. If \fIcount\fR is \fB*\fR,
554	then all of the remaining bytes in \fIstring\fR will be scanned. If
555	\fIcount\fR is omitted, then one 8-bit integer will be scanned. For
556	example,
557	.RS
558	.CS
559	\fBbinary scan\fR \ex07\ex86\ex05 c2c* var1 var2
560	.CE
561	will return \fB2\fR with \fB7 -122\fR stored in \fIvar1\fR and \fB5\fR
562	stored in \fIvar2\fR. Note that the integers returned are signed, but
563	they can be converted to unsigned 8-bit quantities using an expression
564	like:
565	.CS
566	set num [expr { $num & 0xff }]
567	.CE
568	.RE
569	.IP \fBs\fR 5
570	The data is interpreted as \fIcount\fR 16-bit signed integers
571	represented in little-endian byte order. The integers are stored in
572	the corresponding variable as a list. If \fIcount\fR is \fB*\fR, then
573	all of the remaining bytes in \fIstring\fR will be scanned. If
574	\fIcount\fR is omitted, then one 16-bit integer will be scanned. For
575	example,
576	.RS
577	.CS
578	\fBbinary scan\fR \ex05\ex00\ex07\ex00\exf0\exff s2s* var1 var2
579	.CE
580	will return \fB2\fR with \fB5 7\fR stored in \fIvar1\fR and \fB\-16\fR
581	stored in \fIvar2\fR. Note that the integers returned are signed, but
582	they can be converted to unsigned 16-bit quantities using an expression
583	like:
584	.CS
585	set num [expr { $num & 0xffff }]
586	.CE
587	.RE
588	.IP \fBS\fR 5
589	This form is the same as \fBs\fR except that the data is interpreted
590	as \fIcount\fR 16-bit signed integers represented in big-endian byte
591	order. For example,
592	.RS
593	.CS
594	\fBbinary scan\fR \ex00\ex05\ex00\ex07\exff\exf0 S2S* var1 var2
595	.CE
596	will return \fB2\fR with \fB5 7\fR stored in \fIvar1\fR and \fB\-16\fR
597	stored in \fIvar2\fR.
598	.RE
599	.IP \fBt\fR 5
600	.VS 8.5
601	The data is interpreted as \fIcount\fR 16-bit signed integers
602	represented in the native byte order of the machine running the Tcl
603	script. It is otherwise identical to \fBs\fR and \fBS\fR.
604	To determine what the native byte order of the machine is, refer to
605	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
606	.VE 8.5
607	.IP \fBi\fR 5
608	The data is interpreted as \fIcount\fR 32-bit signed integers
609	represented in little-endian byte order. The integers are stored in
610	the corresponding variable as a list. If \fIcount\fR is \fB*\fR, then
611	all of the remaining bytes in \fIstring\fR will be scanned. If
612	\fIcount\fR is omitted, then one 32-bit integer will be scanned. For
613	example,
614	.RS
615	.CS
616	set str \ex05\ex00\ex00\ex00\ex07\ex00\ex00\ex00\exf0\exff\exff\exff
617	\fBbinary scan\fR $str i2i* var1 var2
618	.CE
619	will return \fB2\fR with \fB5 7\fR stored in \fIvar1\fR and \fB\-16\fR
620	stored in \fIvar2\fR. Note that the integers returned are signed, but
621	they can be converted to unsigned 32-bit quantities using an expression
622	like:
623	.CS
624	set num [expr { $num & 0xffffffff }]
625	.CE
626	.RE
627	.IP \fBI\fR 5
628	This form is the same as \fBI\fR except that the data is interpreted
629	as \fIcount\fR 32-bit signed integers represented in big-endian byte
630	order. For example,
631	.RS
632	.CS
633	set str \ex00\ex00\ex00\ex05\ex00\ex00\ex00\ex07\exff\exff\exff\exf0
634	\fBbinary scan\fR $str I2I* var1 var2
635	.CE
636	will return \fB2\fR with \fB5 7\fR stored in \fIvar1\fR and \fB\-16\fR
637	stored in \fIvar2\fR.
638	.RE
639	.IP \fBn\fR 5
640	.VS 8.5
641	The data is interpreted as \fIcount\fR 32-bit signed integers
642	represented in the native byte order of the machine running the Tcl
643	script. It is otherwise identical to \fBi\fR and \fBI\fR.
644	To determine what the native byte order of the machine is, refer to
645	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
646	.VE 8.5
647	.IP \fBw\fR 5
648	The data is interpreted as \fIcount\fR 64-bit signed integers
649	represented in little-endian byte order. The integers are stored in
650	the corresponding variable as a list. If \fIcount\fR is \fB*\fR, then
651	all of the remaining bytes in \fIstring\fR will be scanned. If
652	\fIcount\fR is omitted, then one 64-bit integer will be scanned. For
653	example,
654	.RS
655	.CS
656	set str \ex05\ex00\ex00\ex00\ex07\ex00\ex00\ex00\exf0\exff\exff\exff
657	\fBbinary scan\fR $str wi* var1 var2
658	.CE
659	will return \fB2\fR with \fB30064771077\fR stored in \fIvar1\fR and
660	\fB\-16\fR stored in \fIvar2\fR. Note that the integers returned are
661	signed and cannot be represented by Tcl as unsigned values.
662	.RE
663	.IP \fBW\fR 5
664	This form is the same as \fBw\fR except that the data is interpreted
665	as \fIcount\fR 64-bit signed integers represented in big-endian byte
666	order. For example,
667	.RS
668	.CS
669	set str \ex00\ex00\ex00\ex05\ex00\ex00\ex00\ex07\exff\exff\exff\exf0
670	\fBbinary scan\fR $str WI* var1 var2
671	.CE
672	will return \fB2\fR with \fB21474836487\fR stored in \fIvar1\fR and \fB\-16\fR
673	stored in \fIvar2\fR.
674	.RE
675	.IP \fBm\fR 5
676	.VS 8.5
677	The data is interpreted as \fIcount\fR 64-bit signed integers
678	represented in the native byte order of the machine running the Tcl
679	script. It is otherwise identical to \fBw\fR and \fBW\fR.
680	To determine what the native byte order of the machine is, refer to
681	the \fBbyteOrder\fR element of the \fBtcl_platform\fR array.
682	.VE 8.5
683	.IP \fBf\fR 5
684	The data is interpreted as \fIcount\fR single-precision floating point
685	numbers in the machine's native representation. The floating point
686	numbers are stored in the corresponding variable as a list. If
687	\fIcount\fR is \fB*\fR, then all of the remaining bytes in
688	\fIstring\fR will be scanned. If \fIcount\fR is omitted, then one
689	single-precision floating point number will be scanned. The size of a
690	floating point number may vary across architectures, so the number of
691	bytes that are scanned may vary. If the data does not represent a
692	valid floating point number, the resulting value is undefined and
693	compiler dependent. For example, on a Windows system running on an
694	Intel Pentium processor,
695	.RS
696	.CS
697	\fBbinary scan\fR \ex3f\excc\excc\excd f var1
698	.CE
699	will return \fB1\fR with \fB1.6000000238418579\fR stored in
700	\fIvar1\fR.
701	.RE
702	.IP \fBr\fR 5
703	.VS 8.5
704	This form is the same as \fBf\fR except that the data is interpreted
705	as \fIcount\fR single-precision floating point number in little-endian
706	order. This conversion is not portable to the minority of systems not
707	using IEEE floating point representations.
708	.VE 8.5
709	.IP \fBR\fR 5
710	.VS 8.5
711	This form is the same as \fBf\fR except that the data is interpreted
712	as \fIcount\fR single-precision floating point number in big-endian
713	order. This conversion is not portable to the minority of systems not
714	using IEEE floating point representations.
715	.VE 8.5
716	.IP \fBd\fR 5
717	This form is the same as \fBf\fR except that the data is interpreted
718	as \fIcount\fR double-precision floating point numbers in the
719	machine's native representation. For example, on a Windows system
720	running on an Intel Pentium processor,
721	.RS
722	.CS
723	\fBbinary scan\fR \ex9a\ex99\ex99\ex99\ex99\ex99\exf9\ex3f d var1
724	.CE
725	will return \fB1\fR with \fB1.6000000000000001\fR
726	stored in \fIvar1\fR.
727	.RE
728	.IP \fBq\fR 5
729	.VS 8.5
730	This form is the same as \fBd\fR except that the data is interpreted
731	as \fIcount\fR double-precision floating point number in little-endian
732	order. This conversion is not portable to the minority of systems not
733	using IEEE floating point representations.
734	.VE 8.5
735	.IP \fBQ\fR 5
736	.VS 8.5
737	This form is the same as \fBd\fR except that the data is interpreted
738	as \fIcount\fR double-precision floating point number in big-endian
739	order. This conversion is not portable to the minority of systems not
740	using IEEE floating point representations.
741	.VE 8.5
742	.IP \fBx\fR 5
743	Moves the cursor forward \fIcount\fR bytes in \fIstring\fR. If
744	\fIcount\fR is \fB*\fR or is larger than the number of bytes after the
745	current cursor position, then the cursor is positioned after
746	the last byte in \fIstring\fR. If \fIcount\fR is omitted, then the
747	cursor is moved forward one byte. Note that this type does not
748	consume an argument. For example,
749	.RS
750	.CS
751	\fBbinary scan\fR \ex01\ex02\ex03\ex04 x2H* var1
752	.CE
753	will return \fB1\fR with \fB0304\fR stored in \fIvar1\fR.
754	.RE
755	.IP \fBX\fR 5
756	Moves the cursor back \fIcount\fR bytes in \fIstring\fR. If
757	\fIcount\fR is \fB*\fR or is larger than the current cursor position,
758	then the cursor is positioned at location 0 so that the next byte
759	scanned will be the first byte in \fIstring\fR. If \fIcount\fR
760	is omitted then the cursor is moved back one byte. Note that this
761	type does not consume an argument. For example,
762	.RS
763	.CS
764	\fBbinary scan\fR \ex01\ex02\ex03\ex04 c2XH* var1 var2
765	.CE
766	will return \fB2\fR with \fB1 2\fR stored in \fIvar1\fR and \fB020304\fR
767	stored in \fIvar2\fR.
768	.RE
769	.IP \fB@\fR 5
770	Moves the cursor to the absolute location in the data string specified
771	by \fIcount\fR. Note that position 0 refers to the first byte in
772	\fIstring\fR. If \fIcount\fR refers to a position beyond the end of
773	\fIstring\fR, then the cursor is positioned after the last byte. If
774	\fIcount\fR is omitted, then an error will be generated. For example,
775	.RS
776	.CS
777	\fBbinary scan\fR \ex01\ex02\ex03\ex04 c2@1H* var1 var2
778	.CE
779	will return \fB2\fR with \fB1 2\fR stored in \fIvar1\fR and \fB020304\fR
780	stored in \fIvar2\fR.
781	.RE
782	.SH "PORTABILITY ISSUES"
783	The \fBr\fR, \fBR\fR, \fBq\fR and \fBQ\fR conversions will only work
784	reliably for transferring data between computers which are all using
785	IEEE floating point representations. This is very common, but not
786	universal. To transfer floating-point numbers portably between all
787	architectures, use their textual representation (as produced by
788	\fBformat\fR) instead.
789	.SH EXAMPLES
790	This is a procedure to write a Tcl string to a binary-encoded channel as
791	UTF-8 data preceded by a length word:
792	.CS
793	proc \fIwriteString\fR {channel string} {
794	set data [encoding convertto utf-8 $string]
795	puts -nonewline [\fBbinary format\fR Ia* \e
796	[string length $data] $data]
797	}
798	.CE
799	.PP
800	This procedure reads a string from a channel that was written by the
801	previously presented \fIwriteString\fR procedure:
802	.CS
803	proc \fIreadString\fR {channel} {
804	if {![\fBbinary scan\fR [read $channel 4] I length]} {
805	error "missing length"
806	}
807	set data [read $channel $length]
808	return [encoding convertfrom utf-8 $data]
809	}
810	.CE
811	.SH "SEE ALSO"
812	format(n), scan(n), tclvars(n)
813	.SH KEYWORDS
814	binary, format, scan

Note: See TracBrowser for help on using the repository browser.

Download in other formats:

Original Format