We could definitely agree that time is the fourth dimension, if we wanted to. It's only a logical way of seeing it.
After all, time is correlated with space, that's how our brains measure it, and that's how clocks measure time, using motion. In other words, we trust the constant motion of a clock just as we trust a ruler to keep its size constant.
I get confused when I read that time moves slower at higher velocities. Because time is measured with motion, something must move, and it must do so reliably in order to be able to log time.
If a ruler changed its size, does that mean centimeters got bigger? No, it means the ruler got bigger.
If a clock for whatever reason moves slower (gaining mass for some reason, or battery running low) all it means is that its motion is being restricted for some physical reason. Not because its offsetting in time.
I don't see the logic in things offsetting in time, that's not how we agreed to correlate time and space. Either we need a new type of measurement called different than "time", or just agree time and space are correlated directly.
Anyway, just as we use a ruler's length to calculate distance, we use a clock's motion to calculate time. Meaning that's what time is! The motion of things.
We could agree for just a moment that we're going to take temperature as the 1st dimension, and time as the 2nd dimension. We can definitely agree on that, if we wanted to. We'd get a graph of temperature varying over time!
But the difference is that when we agree to measure space in three dimensions, it's a correlation we're making between three dimensions of the same type, which is distance (distance x distance x distance).
Distance alone is a single dimension, right? But if we agree on correlate a single distance dimension with another one, we have two distance dimensions, logically speaking. We can correlate one single value of the first dimension with another value of second dimension, that means we get a point in 2D space.
Our brains naturally respond to this correlation visually, because that's how our sense of sight works. There's no trouble comprehending this logic of 2 dimensions.
Funny thing is, we just correlated two dimensions of the SAME unit type (distance). And we can even do it one more time, adding a 3rd dimension of the same unit type.
The correlation between three dimensions, logically speaking is a representation of space.
But what about a 4th dimension of the same unit type? Can we correlate a point in space, with yet a new dimension of distance?
The truth is that when we use the sense of sight, we're not even perceiving the world as a 3D space. Both our eyes gather 2D information. Little hints like (depth of field blurring, and eye separation), even parallax effects on motion grants our brain enough info to interpret the world in this manner (3D), but it's only after the brain processed it.
To us it wouldn't make much sense to correlate 3D space with yet another distance dimension, it's not the way our brain works!
We are mass, hence we're energy, and so is our brain. And we never chose how our brains would interact with the rest of the universe. In fact the sense that our brain is mass "separated" with the rest of the universe is a fabrication of our thoughts!
In the biological sense, using 4 dimensions is not how our brain works, we don't perceive the world that way. I don't think there's a single living creature on Earth capable of actually working that way.
But humans are logical creatures. Just as we can correlate temperature with time. We can choose to correlate 3D space with time, if we decide to.
And that is in fact useful to our brains and its biology, because that's how the brain works. Constantly receiving information, processing it and more importantly STORING it, this gives us a perception of time.
Just like a clock can tell us the time, our brain, biologically speaking, can tell us the time as well (even if it's not as reliable as a clock).
So yeah, we can perfectly agree that time is an extra dimension. Except it's not a dimension of distance, we use a time dimension.
Meaning we can correlate all space to time. Which gives us a 4D representation, of something our brain is not made to interpret visually, yet we're perfectly able to understand.
Even if we tried to picture 4 dimensions visually, we can only work with the tools our brain gives us.
We naturally interpret it as motion. Or we could picture as a 3D still, of everything overlapping, too!
In fact, the whole universe, since the beginning of time, and to its end, can be interpreted as this big overlapped 3D volume, illustrating everything that happened since the beginning of time in a single shape. Needless to say only god can perceive something so complex!