![]() |
Typical errors of porting C++ code on the 64-bit platform
Annotation. Program errors occurring while porting C++ code from 32-bit platforms on 64-bit ones are observed. Examples of the incorrect code and the ways to correct it are given. Methods and means of the code analysis which allow to diagnose the errors discussed, are listed.
This text is an abridged variant of "20 issues of porting C++ code on the 64-bit platform" article. If you are interested in the topic of porting programs to 64-bit systems, you can find a full version of the article here 20 issues of porting C++ code on the 64-bit platform. We offer you to read the article devoted to the port of the program code of 32-bit applications on 64-bit systems. The article is written for programmers who use C++ but it may be also useful for all who face the port of applications on other platforms. One should understand properly that the new class of errors which appears while writing 64-bit programs is not just some new incorrect constructions among thousands of others. These are inevitable difficulties which the developers of any developing program will face. This article will help you to be ready for these difficulties and will show the ways to overcome them. Besides advantages, any new technology (in programming and other spheres as well) carries some limitations and even problems of using this technology. The same situation can be found in the sphere of developing 64-bit software. We all know that 64-bit software is the next step of the information technologies development. But in reality, only few programmers have faced the nuances of this sphere and developing 64-bit programs in particular. We won?t dwell on the advantages which the use of 64-bit architecture opens before programmers. There are a lot of publications devoted to this theme and the reader can find them easily. The aim of this article is to observe thoroughly those problems which the developer of 64-bit programs can face. In the article you will learn about: The aim of this article is to observe thoroughly those problems which the developer of 64-bit programs can face. In the article you will learn about:
The given information will allow you:
There are a lot of examples given in the article which you should try in the programming environment for better understanding. Going into them you will get something more than just an addition of separate elements. You will open the door into the world of 64-bit systems. We'll use term "memsize" type in the text. By this we'll understand any simple integer type which is capable to keep a pointer and changes its size according to the change of the platform dimension form 32-bit to 64-bit. The examples memsize types: size_t, ptrdiff_t, all pointers, intptr_t, INT_PTR, DWORD_PTR. We should say some words about the data models which determine the correspondence of the sizes of fundamental types for different systems. Table N1 contains data models which can interest us. Code:
ILP32 LP64 LLP64 ILP64And finally, 64-bit model in Linux (LP64) differs from that in Windows (LLP64) only in the size of long type. So long as it is their only difference, to summarize the account we'll avoid using long, unsigned long types, and will use ptrdiff_t, size_t types. Let's start observing the type errors which occur while porting programs on the 64-bit architecture. 1. Off-warnings.In all books devoted to the development of the quality code it is recommended to set a warning level of warnings shown by the compiler on as high level as possible. But there are situations in practice when for some project parts there is a lower diagnosis level set or it is even set off. Usually it is very old code which is supported but not modified. Programmers who work over the project are used to that this code works and don't take its quality into consideration. Here it is a danger to miss serious warnings by the compiler while porting programs on the new 64-bit system. While porting an application you should obligatory set on warnings for the whole project which help to check the compatibility of the code and analyze them thoroughly. It can help to save a lot of time while debugging the project on the new architecture. If we won't do this the simplest and stupidest errors will occur in all their variety. Here it is a simple example of overflow which occurs in a 64-bit program if we ignore warnings at all. Code:
unsigned char *array[50];2. Use of the functions with a variable number of arguments.The typical example is the incorrect use of printf, scanf functions and their variants: Code:
1) const char *invalidFormat="%u";Code:
2) char buf[9];In the second case the author of the code didn't take into account that the pointer size may become more than 32-bit in future. As a result this code will cause the buffer overflow on the 64-bit architecture. The incorrect use of functions with a variable number of arguments is a typical error on all the architectures, not only on 64-bit ones. This is related to the fundamental danger of the use of the given C++ language constructions. The common practice is to refuse them and use safe programming methods. We recommend you strongly to modify the code and use safe methods. For example, you may replace printf with cout, and sprintf with boost::format or std::stringstream. If you have to support the code which uses functions of sscanf type, in the control lines format we can use special macros which open into necessary modifiers for different systems. An example: Code:
// PR_SIZET on Win64=I"3. Magic numbersIn the low-quality code there are often magic numbers the mere presence of which is dangerous. During the migration of the code on the 64-bit platform these magic numbers may make the code inefficient if they participate in operations of calculation of address, objects size or bit operations. Table N2 contains basic magic numbers which may influence the workability of an application on a new platform. Code:
Value DescriptionLet's look at some errors related to the use of magic numbers. The most frequent is a record of number values of type sizes. Code:
1) size_t ArraySize=N * 4;Code:
2) size_t values[ARRAY_SIZE];Code:
3) size_t n, newexp;Code:
1) size_t ArraySize=N * sizeof(intptr_t);Code:
2) size_t values[ARRAY_SIZE];Code:
3) size_t n, newexp;Code:
// constant '1111..110000'Code:
#ifdef _WIN64Code:
#define INVALID_RESULT (0xFFFFFFFFu)Code:
int a=-1; // 0xFFFFFFFFi32Let's return to the error with INVALID_RESULT. The use of the number 0xFFFFFFFFu causes the failure of the execution of "len == (size_t)(-1)" condition in a 64-bit program. The best solution is to change the code in such a way that it doesn't need special marker values. If you cannot refuse them for some reason or think it unreasonable to correct the code fundamentally just use fair value -1. Code:
#define INVALID_RESULT (size_t(-1))4. Bit shifting operations.Bit shifting operations can cause a lot of troubles during the port from the 32-bit system on the 64-bit one if used inattentively. Let's begin with an example of a function which defines the bit you've chosen as 1 in a variable of memsize type. Code:
ptrdiff_t SetBitN(ptrdiff_t value, unsigned bitNum) {Pay attention that "1" has int type and during the shift on 32 positions an overflow will occur. To correct the code it is necessary to make the constant "1" of the same type as the variable mask. Code:
ptrdiff_t mask=ptrdiff_t(1) << bitNum;5. Storing of pointer addresses.A large number of errors during the migration on 64-bit systems are related to the change of a pointer size in relation to the size of usual integers. In the environment with the data ILP32 usual integers and pointers have the same size. Unfortunately the 32-bit code is based on this supposition everywhere. Pointers are often casted to int, unsigned int and other types improper to fulfill address calculations. You should understand exactly that one should use only memsize types for integer pointers form. Preference should be given to uintptr_t type for it shows intentions more clearly and makes the code more portable saving it from changes in future Let's look at two small examples. Code:
1) char *p;Code:
2) DWORD tmp=(DWORD)malloc(ArraySize); Code:
1) char *p;Code:
2) DWORD_PTR tmp=(DWORD_PTR)malloc(ArraySize); The following code won't hide and will show up at the first execution. Code:
void GetBufferAddr(void **retPtr) {Code:
uintptr_t bufAddress;In the end I'd like to mention that it will be a bad style to store the pointer address into types which are always equal 64-bits. One will have to correct the code shown further again when 128-bit systems will appear. Code:
PVOID p;6. Memsize types in unions.The peculiarity of a union is that for all members of the union the same memory area is allocated that is they overlap. Although the access to this memory area is possible with the use of any of the elements the element for this aim should be chosen so that the result won't be senseless. One should pay attention to the unions which contain pointers and other members of memsize type. When there is a necessity to work with a pointer as an integer sometimes it is convenient to use the union as it is shown in the example, and work with the number form of the type without using explicit conversions. Code:
union PtrNumUnion {Code:
union PtrNumUnion {Code:
union SizetToBytesUnion {Code:
union SizetToBytesUnion {7. Change of an array typeSometimes it is necessary (or just convenient) in programs to present array items in the form of the elements of a different type. Dangerous and safe type conversions are shown in the following code. Code:
int array[4]={ 1, 2, 3, 4 };On the 64-bit system we got "2 17179869187" in the output for it is value 17179869187 which is situated in the first item of sizetPtr array. In some cases we need this very behavior but usually it is an error. The correction of the described situation consists in the refuse of dangerous type conversions by modernizing the program. Another variant is to create a new array and copy values of the original one into it. 8. Virtual functions with arguments of memsize typeIf there are big derived class graphs with virtual functions in your program, there is a risk to use inattentively arguments of different types but these types actually coincide on the 32-bit system. For example, in the base class you use size_t type as an argument of a virtual function and in the derived class type unsigned. So this code will be incorrect on the 64-bit system. But an error like this doesn't necessarily hide in big derived class graphs and here it is one of the examples. Code:
class CWinApp {Code:
virtual void WinHelp(DWORD dwData, UINT nCmd=HELP_CONTEXT);The correction consists in the use of the same types in the corresponding virtual functions. Code:
class CSampleApp : public CWinApp {9. Serialization and data exchange.An important point during the port of a software solution on the new platform is succession to the existing data exchange protocol. It is necessary to provide the read of the existing projects formats, to carry out the data exchange between 32-bit and 64-bit processes etc. Mostly the errors of this kind consist in the serialization of memsize types and data exchange operations using them. Code:
1) size_t PixelCount;Code:
2) __int32 value_1;Code:
3) time_t time;The use of types of volatile size. It is unacceptably to use types which change their size depending on the development environment in binary interfaces of data exchange. In C++ language all the types don't have distinct sizes and consequently it is not possible to use them all for these aims. That's why the developers of the development means and programmers themselves develop data types which have an exact size such as __int8, __int16, INT32, word64 etc. The use of such types provides data portability between programs on different platforms although it needs the use of odd ones. The three shown examples are written inaccurately and this will show up on the changing of the capacity of some data types from 32-bit to 64-bit. Taking into account the necessity to support old data formats the correction may look as follows. Code:
1) size_t PixelCount;Code:
2) __int32 value_1;Code:
3) time_t time;Ignoring of the byte order. Even after the correction of volatile type sizes you may face the incompatibility of binary formats. The reason is a different data presentation. Most frequently it is related to a different byte order. The byte order is a method of recording of bytes of multibyte numbers. The little-endian order means that the recording begins with the lowest byte and ends with the highest one. This record order was acceptable in the memory of PCs with x86-processors. The big-endian order - the recording begins with the highest byte and ends with the lowest one. This order is a standard for TCP/IP protocols. That's why the big-endian byte order is often called the network byte order. This byte order is used by processors Motorola 68000, SPARC. While developing the binary interface or data format you should remember about the byte order. If the 64-bit system on which you are porting a 32-bit application has a different byte order you'll just have to take it into account in your code. For conversion between the big-endian byte order and the little-endian one you may use functions htonl(), htons(), bswap_64 etc. 10. Pointer address arithmetic.The first example. Code:
unsigned short a16, b16, c16;One should take care to avoid possible overflows in pointer arithmetic. For this purpose it's better to use memsize types or the explicit type conversion in expressions which carry pointers. Using the explicit type conversion we can rewrite the code in the following way. Code:
short a16, b16, c16;Code:
int A=-2;
Then calculation of "ptr + 0xFFFFFFFFu" takes place but the result of it depends on the pointer size on the particular architecture. If addition will take place in a 32-bit program the given expression will be an equivalent of "ptr-1" and we'll successfully print number 3. In a 64-bit program 0xFFFFFFFFu value will be added fairly to the pointer and the result will be that the pointer will be outbound of the array. And while getting access to the item of this pointer we'll face troubles. To avoid the shown situation, as well as in the first case, we advise you to use only memsize types in pointer arithmetic. Here are two variants of the code correction: Code:
ptr=ptr + (ptrdiff_t(A) + ptrdiff_t(B));Code:
ptrdiff_t A=-2;Code:
int A=-2;
11. Arrays indexing.This kind of errors is separated from the others for better structuring of the account because indexing in arrays with the use of square brackets is just a different record of address arithmetic observed before. In programming in language C and then C++ a practice formed to use in the constructions of the following kind variables of int/unsigned types: Code:
unsigned Index=0;The given code won't process in a 64-bit program an array containing more than UINT_MAX items. After the access to the item with UNIT_MAX index an overflow of the variable Index will occur and we'll get infinite loop. To persuade you entirely in the necessity of using only memsize types for indexing and in the expressions of address arithmetic, I'll give the last example. Code:
class Region {Programmers often make a mistake trying to correct the code in the following way: Code:
float Region::GetCell(int x, int y, int z) const {If you want to correct the code without changing types of the variables participating in the expression you may use the explicit type conversion of every variable memsize type: Code:
float Region::GetCell(int x, int y, int z) const {Code:
typedef ptrdiff_t TCoord;12. Mixed use of simple integer types and memsize types.Mixed use of memsize and non-memsize types in expressions may cause incorrect results on 64-bit systems and be related to the change of the input values rate. Let's study some examples. Code:
size_t Count=BigValue;Another frequent error is a record of the expressions of the following kind: Code:
int x, y, z;Let's give an example of a small code which shows the danger of inaccurate expressions with mixed types (the results are got with the use Microsoft Visual C++ 2005, 64-bit compilation mode). Code:
int x=100000;Code:
intptr_t v2=intptr_t(x) * y * z;The order of the calculation of an expression with operators of the same priority is not defined. To be more exact, the compiler can calculate sub-expressions in such an order which it considers to be more efficient even if sub-expressions cause (side effect). The order of the appearing of side effects is not defined. Expressions including communicative and association operations (*, +, &, |, ^), may be converted in a free way even if there are brackets. To assign the strict order of the calculation of the expression it is necessary to use the explicit temporary variable. That's why if the result of the expression should be of memsize type, only memsize types must participate in the expression. The right variant: Code:
//OK:Mixed use of types may occur in the change of the program logic. Code:
ptrdiff_t val_1=-1;If you need to return the previous behavior you should change the variable val_2 type. Code:
ptrdiff_t val_1=-1;13. Implicit type conversions while using functions.Observing the previous kind of errors related to mixing of simple integer types and memsize types, we surveyed only simple expressions. But similar problems may occur while using other C++ constructions too. Code:
extern int Width, Height, Depth;Code:
extern int Width, Height, Depth;Code:
extern char *begin, *end;And here it is one more example but now we'll observe not the returned value but the formal function argument. Code:
void foo(ptrdiff_t delta);14. Overload functions.During the port of 32-bit programs on the 64-bit platform the change of the logic of its work may be found which is related to the use of overload functions. If the function is overlapped for 32-bit and 64-bit values the access to it with the argument of memsize type will be compiled into different calls on different systems. This method may be useful as, for example, in the following code: Code:
static size_t GetBitCount(const unsigned __int32 &) {Code:
class MyStack {We think you understand this kind of errors and that you should pay attention to the call of overload functions transferring actual arguments of memsize type. 15. Data alignment.Processors work more efficiently when they deal with data aligned properly. As a rule the 32-bit data item must be aligned at the border multiple 4 bytes and the 64-bit item at the border 8 bytes. The try to work with unaligned data on processors IA-64 (Itanium) as it is shown in the following example, will cause exception. Code:
#pragma pack (1) // Also set by key /Zp in MSVCCode:
#pragma pack (1) // Also set by key /Zp in MSVCOn the architecture x64 during the access to unaligned data exception does not occur but you should avoid them either. Firstly, because of the essential slowing down of the speed of the access to these data, and secondly, because of a high probability of porting the program on the platform IA-64 in future. Let's look at one more example of the code which does not take into account the data alignment. Code:
struct MyPointersArray {The correct calculation of the size should look as follows: Code:
struct MyPointersArray {Always use these macros to get a shift in the structure without relying on your knowledge of the sizes of types and the alignment. Here it is the example of the code with the correct calculation of the structure member address: Code:
struct TFoo {16. The use of outdated functions and predefined constants.While developing a 64-bit application, be sure to bear in mind the changes of the environment in which it will be performed. Some functions will become outdated and it will be necessary to replace them with some variants. GetWindowLong is a good example of such function in the Windows operation system. Pay your attention to the constants referring to the interaction with the environment in which the program is functioning. In Windows the lines containing "system32" or "Program Files" will be suspect. 17. Explicit type conversions.Be accurate with explicit type conversions. They may change the logic of the program execution when types change their capacity of cause the loss of significant bits. It is difficult to adduce typical examples of errors related to the explicit type conversion for they are very different and specific for different programs. You have become acquainted with some errors related to the explicit type conversion earlier. Error diagnosis.The diagnosis of the errors occurring while porting 32-bit programs on 64-bit systems is a difficult task. The port of a not very quality code written without taking into account peculiarities of other architectures, may demand a lot of time and efforts. That's why we'll pay some attention to the description of methods and means which may simplify this task. Unit test.Unit test have fought well-earned respect among programmers long ago. Unit tests will help to check the correctness of the program after the port on a new platform. But there is one nuance which you should keep in mind. Unit test may not allow you to check the new ranges of input values which become accessible on 64-bit systems. Unit tests are originally developed in such a way that they can be passed in a short time. And the function which usually works with an array with the size of tens of Mb, will probably process tens of Kb in unit tests. It is justified for this function in tests may be called many times with different sets of input values. But suppose you have a 64-bit variant of the program. And now the function we study is processing more than 4 Gb of data. Surely there appears a necessity to raise the input size of an array in the tests either up to sizes more than 4 Gb. The problem is that the time of passing the tests will increase greatly in such a case. That's why while modifying the sets of tests keep in mind the compromise between speed of passing unit tests and the fullness of the checks. Fortunately, there are other methods which can help you to make sure of the efficiency of your applications. Code review.Code review is the best method of searching errors and improving code. Combined thorough code review may help to get rid of the errors in the program completely which are related to the peculiarities of the development of 64-bit applications. Of course, in the beginning one should learn which errors exactly one should search, otherwise the review won't give good results. For this purpose it is necessary to read this and other articles devoted to the port of programs from 32-bit systems on 64-bit ones, in time. Some interesting links concerning this topic can be found in the end of the article. But this approach to the analysis of the original code has on significant disadvantage. It demands a lot of time and because of this it is actually inapplicable on large projects. The compromise is the use of static analyzers. A static analyzer can be considered to be an automated system for code review where a fetch of potentially dangerous places is created for a programmer so that he could carry out the further analysis. But in any case it is desirable to provide several code reviews for combined teaching the team to search for new kinds of errors occurring on 64-bit systems. Built-in means of compilers.Compilers allow to solve some problems of searching defect code. They often have built-in different mechanisms for diagnosing errors observed. For example, in Microsoft Visual C++ 2005 the following keys may be useful: /Wp64, /Wall, and in SunStudio C++ key -xport64. Unfortunately, the possibilities they provide are often not enough and you should not rely only on them. But in any case it is highly recommended to enable the corresponding options of a compiler for diagnosing errors in the 64-bit code. Static analyzers.Static analyzers are a fine means to improve quality and safety of the program code. The basic difficulty related to the use of static analyzers consists in the fact that they generate quite a lot false warning messages about potential errors. Programmers being lazy by nature use this argument to find some way not to correct the found errors. In Microsoft this problem is solved by including necessarily the found errors in the bug tracking system. Thus a programmer just cannot choose between the correction of the code and tries to avoid this. We think that such strict rules are justified. The profit of the quality code covers the outlay of time for static analysis and corresponding code modification. This profit is achieved by means of simplifying the code support and reducing the time of debugging and testing. Static analyzers may be successfully used for diagnosing many of the kinds of errors observed in the article. The authors know 3 static analyzers which are supposed to have means of diagnosing errors related to the port of programs on 64-bit systems. We would like to warn you at once that we may be mistaken about the possibilities they have, moreover, these are developing products and new versions may have great efficiency.
Conclusion.If you read these lines we are glad that you're interested. We hope the article has been useful for you and will help to simplify the development and debugging of 64-bit applications. We will be glad to receive your opinions, remarks, corrections, additions and will surely include them in the next version of the article. The more we'll describe typical errors the more profitable will be the use of our experience and getting help. Resources.
|
Re: Typical errors of porting C++ code on the 64-bit platform
Surely this article is very useful. But it has become a bit obsolete
by now and has some inaccuracies. To get a more detailed description of 64-bit errors, please see the lessons on development of 64-bit applications. |
| All times are GMT +5.5. The time now is 14:06. |