Dope vector


In computer programming, a dope vector is a data structure used to hold information about a data object, especially its memory layout.

Purpose

Dope vectors are most commonly used to describe arrays, which commonly store multiple instances of a particular datatype as a contiguous block of memory. For example, an array containing 100 elements, each of which occupies 32 bytes, requires 100 × 32 bytes. By itself, such a memory block has no place to keep track of how large the array is overall, how large each element within it is, or how many elements it contains. A dope vector is a place to store such information. Dope vectors can also describe structures which may contain arrays or variable elements.
If such an array is stored contiguously, with the first byte at memory location M, then its last byte is at location. A major advantage of this arrangement is that locating item N is easy: it begins at location. Of course, the value 32 must be known. Navigating an array data structure using an index is called dead reckoning.
This arrangement, however means that having the location of item N is not enough to discover the index N itself; or the stride; or whether there are elements at or. For example, a function or method may iterate over all the items in an array and pass each one to another function or method, which does not know the item is part of an array at all, much less where or how large the array is.
Without a dope vector, even knowing the address of the entire array does not tell you how big it is. This is important because writing to the element in an array that only contains N elements, will likely destroy some other data. Because many programming languages treat character strings as a kind of array, this leads directly to the infamous Buffer overflow problem.
A dope vector reduces these problems by storing a small amount of metadata along with an array. With dope vectors, a compiler can easily insert code that prevents accidentally writing beyond the end of an array or other object. Alternatively, the programmer can access the dope vector when desired, for safety or other purposes.

Description

The exact set of metadata included in a dope vector varies from one language and/or operating system to another, but a dope vector for an array might contain:
A program then can refer to the array by referring to the dope vector. This is commonly automatic in high level languages. Getting to an element of the array costs a tiny bit more. On the other hand, doing many other common operations are easier and/or faster:
Even with a dope vector, having a pointer to a particular member of an array does not enable finding the position in the array, or the location of the array or the dope vector itself. If that is desired, such information can be added to each element within the array. Such per-element information can be useful, but is not part of the dope vector.
Dope vectors can be a general facility, shared across multiple datatypes