Beyond the simple wear that check standards can experience, the mass of even the carefully stored national prototypes can drift relative to the IPK for a variety of reasons, some known and some unknown. Since the IPK and its replicas are stored in air (albeit under two or more nested bell jars), they gain mass through adsorption of atmospheric contamination onto their surfaces. Accordingly, they are cleaned in a process the BIPM developed between 1939 and 1946 known as "the BIPM cleaning method"[27] that comprises firmly rubbing with a chamois soaked in equal parts ether and ethanol, followed by steam cleaning with bi-distilled water, and allowing the prototypes to settle for 7–10 days before verification. Before the BIPM's published report in 1994 detailing the relative change in mass of the prototypes, different standard bodies used different techniques to clean their prototypes. The NIST's practice before then was to soak and rinse its two prototypes first in benzene, then in ethanol, and to then clean them with a jet of bi-distilled water steam. Cleaning the prototypes removes between 5 and 60 μg of contamination depending largely on the time elapsed since the last cleaning. Further, a second cleaning can remove up to 10 μg more. After cleaning—even when they are stored under their bell jars—the IPK and its replicas immediately begin gaining mass again. The BIPM even developed a model of this gain and concluded that it averaged 1.11 μg per month for the first 3 months after cleaning and then decreased to an average of about 1 μg per year thereafter. Since check standards like K4 are not cleaned for routine calibrations of other mass standards—a precaution to minimise the potential for wear and handling damage—the BIPM's model of time-dependent mass gain has been used as an "after cleaning" correction factor.
Because the first forty official copies are made of the same alloy as the IPK and are stored under similar conditions, periodic verification using a number of replicas—especially the national primary standards, which are rarely used—can convincingly demonstrate the stability of the IPK. What has become clear after the third periodic verification performed between 1988 and 1992 is that masses of the entire worldwide ensemble of prototypes have been slowly but inexorably diverging from each other. It is also clear that the IPK lost perhaps 50 μg of mass over the last century, and possibly significantly more, in comparison to its official copies.[17][28] The reason for this drift has eluded physicists who have dedicated their careers to the SI unit of mass. No plausible mechanism has been proposed to explain either a steady decrease in the mass of the IPK, or an increase in that of its replicas dispersed throughout the world.[Note 5][29][30][31] Moreover, there are no technical means available to determine whether or not the entire worldwide ensemble of prototypes suffers from even greater long-term trends upwards or downwards because their mass "relative to an invariant of nature is unknown at a level below 1000 μg over a period of 100 or even 50 years".[28] Given the lack of data identifying which of the world's kilogram prototypes has been most stable in absolute terms, it is equally valid to state that the first batch of replicas has, as a group, gained an average of about 25 μg over one hundred years in comparison to the IPK.[Note 6]
What is known specifically about the IPK is that it exhibits a short-term instability of about 30 μg over a period of about a month in its after-cleaned mass.[32] The precise reason for this short-term instability is not understood but is thought to entail surface effects: microscopic differences between the prototypes' polished surfaces, possibly aggravated by hydrogen absorption due to catalysis of the volatile organic compounds that slowly deposit onto the prototypes as well as the hydrocarbon-based solvents used to clean them.[31][33]
It has been possible to rule out many explanations of the observed divergences in the masses of the world's prototypes proposed by scientists and the general public. The BIPM's FAQ explains, for example, that the divergence is dependent on the amount of time elapsed between measurements and not dependent on the number of times the prototype or its copies have been cleaned or possible changes in gravity or environment.[34] Reports published in 2013 by Peter Cumpson of Newcastle University based on the X-ray photoelectron spectroscopy of samples that were stored alongside various prototype kilograms suggested that one source of the divergence between the various prototypes could be traced to mercury that had been absorbed by the prototypes being in the proximity of mercury-based instruments. The IPK has been stored within centimetres of a mercury thermometer since at least as far back as the late 1980s.[35] In this Newcastle University work six platinum weights made in the nineteenth century were all found to have mercury at the surface, the most contaminated of which had the equivalent of 250 μg of mercury when scaled to the surface area of a kilogram prototype.