This article contains functions and features that are not documented by the original manufacturer. By following advice in this article, you're doing so at your own risk. The methods presented in this article may rely on internal implementation and may not work in the future.

Intro

This blog post will give some explanations of the internal use of the __imp_ and __imp_load_ prefixes by Microsoft compilers.

Types of the CALL instruction

In the Intel architecture the compiler can generally use two types of the call CPU instruction:

CALL rel32 - specifies relative displacement against the next instruction.
CALL [m] - specifies absolute indirect address.

If we make a call to a function inside some module (or if the distance to the function is within -2Gb/+2Gb, or -0x80000000/+0x7FFFFFFF) then using the CALL rel32 instruction becomes more efficient, than the CALL [m] one. Here's why:

5 bytes versus 6.
CALL rel32 is base-independent and doesn't need any additional relocation. CALL [m] on the other hand, requires an absolute address in memory in [m]. This means that every such memory slot needs to be described in the relocation table (which adds +2 bytes), plus we also need to fix this address when the module is relocated.

That is why compiler tries to use CALL rel32 instruction to call functions inside the same module.

But when calling functions in another module, the situation changes.

First of all, it may not be possible to use CALL rel32 as we can't go further out than the -2Gb/+2Gb from that instruction. Even in the 32-bit code (with 4Gb memory space) this may not be enough. (Although in Windows it is enough since we can't cross the boundary between the user and kernel space.) But in the 64-bit system, some mapped modules often sit too far apart from each other in memory. For instance, one module can be mapped at 0x7FF729110000 and another one at 0x7FFFE0480000 with a relative offset of 0x8B7370000 between them, that exceeds -0x80000000/+0x7FFFFFFF.

Then, even if all modules weren't mapped further out than -2Gb/+2Gb from each other (which is possible for 32-bit Windows), there's also another problem - relocation.

When we make a call with CALL rel32 inside a module, the distance between 2 functions (rel32) is unchangeable no matter what base address the module is mapped at. That distance is known at the linking-time. But if this is a call between two modules that can be mapped at different base addresses, the distance between them is different every time they load, and we can know it only during run-time. That is why every CALL rel32 instruction for function calls in another module needs to be described with the use of relocation.

Of course, we will also need relocation for the CALL [m] instructions. But it is only needed for the memory slot it points to (or [m]) and not for the instruction itself. There are (usually) a smaller number of such memory slots needed than the instructions themselves.

As an example, let's take a call to the CloseHandle function. A program may contain multiple calls to that function from different places. In other words, there're many instructions like:

x86[Copy]

call [__imp_CloseHandle]

But in that case the memory slot (with the address of the kernel32!CloseHandle function) will be only one. We can also group all the memory slots into a continuous linear array (usually not larger than a single memory page in size) and thus we will have to modify only that page during relocation. Such region is called "Import Address Table" (IAT) and it is described in the IMAGE_DIRECTORY_ENTRY_IAT directory in the PE file.

Thus we have to call functions in two different ways:

CALL rel32 - if the function is located inside the module where the call instruction is.
CALL [m] - if the function is located inside a different module.

In this case we are not talking about virtual function calls, where most of the times compiler uses CALL [m] instruction, where [m] points to an address inside a virtual function table. But that is done for the purpose of flexibility and functionality and thus we're not concerned with saving space there.

Also, at times, one may notice the CALL reg instruction in a well optimized code. The compiler may do this in a situation as such:

x86-64[Copy]

call [__imp_CloseHandle]
call [__imp_CloseHandle]
; ...
call [__imp_CloseHandle]

The code above can be optimized to:

x86-64[Copy]

mov rbx, [__imp_CloseHandle]     ; we read once from [__imp_CloseHandle] and place the result into the non-volatile register: RBX
call rbx
call rbx
; ...
call rbx

Thus, to call a function from one module to another we can use different forms of the call instruction. But to pick one, the compiler needs to know where the function that is being called is location. By default the compiler assumes that it is located in the same module, and generates the CALL rel32 instruction.

If we want to tell the compiler that some function is located in another module, there is a special __declspec(dllimport) attribute. When the compiler detects a function declared with it:

Blog Post

Intricacies of Microsoft Compilers - Part 2

The use of __imp_ and __imp_load_ prefixes.

Intro

Types of the CALL instruction

CL.exe Specifics

Delay Load Imports and __imp_load_ Prefix

Conclusion

Social Media

Contact

Related Articles