This question is similar to my prior thread but also a bit different -- and I don't want to get flagged for spam for resurrecting an old thread. I also would like to post this on stackoverflow, hope that doesn't offend anyone.

I have some Javascript extracting a Program Date Time (PDT) value, first_pdt, from a livestream and posting to PHP via AJAX:

$first_pdt = isset($_POST['first_pdt']) ? $_POST['first_pdt'] : NULL;

$first_pdt should have an ISO 8601 format, probably with fractional seconds like so:

2021-09-21T22:19:51.689+00:00

Note a) the fractional seconds and b) the timezone offset.

I need to a) store this in a db such that no time conversions occur (it's a timestamp FFS) and b) periodically compare the db stored value to a new, incoming value from a subsequent POST. This is deceptively difficult.

For starters, strtotime() ignores the fractional seconds so I have bothered to cook up a bit of regex and round code to round it to this before any db inserts or comparisons:

2021-09-21T22:19:52+00:00

Next, I want to insert this value into the database unmolested by any timezone conversions. The MySQL docs say that:

In MySQL 8.0.19 and later, you can specify a time zone offset when inserting a TIMESTAMP or DATETIME value into a table.

This strongly suggests to me that supplying the time zone offset will short-circuit any conversion attempts and telegraph to MySQL that I'm talking about the time in some specific time zone. I tried a bit of experimentation. I've defined this table on my workstation (PDT time) and a server (UTC time), each of which is running MySQL server:

mysql> explain wp_video_tracking;
+--------------+------------+------+-----+-------------------+-------------------+
| Field        | Type       | Null | Key | Default           | Extra             |
+--------------+------------+------+-----+-------------------+-------------------+
| id           | bigint     | NO   | PRI | NULL              | auto_increment    |
| created      | datetime   | NO   |     | CURRENT_TIMESTAMP | DEFAULT_GENERATED |
| last_updated | datetime   | NO   |     | CURRENT_TIMESTAMP | DEFAULT_GENERATED |
| user_id      | bigint     | NO   | MUL | NULL              |                   |
| video_id     | bigint     | NO   | MUL | NULL              |                   |
| watch_data   | text       | NO   |     | NULL              |                   |
| pdt_method   | varchar(5) | YES  |     | NULL              |                   |
| first_pdt    | datetime   | YES  |     | NULL              |                   |

And I've inserted a record on both machines:

INSERT INTO wp_video_tracking (user_id, video_id, watch_data, pdt_method, first_pdt) VALUES (3, 3, 'foo', 'bar', '2021-09-21T22:19:52+00:00');

And then I've connected to each machine via SSH, opened a mysql client session via CLI, and queried for the new record. My workstation (PDT time):

mysql> SELECT * FROM wp_video_tracking WHERE user_id=3 AND video_id=3;
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+
| id | created             | last_updated        | user_id | video_id | watch_data | pdt_method | first_pdt           |
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+
|  8 | 2021-09-30 10:29:40 | 2021-09-30 10:29:40 |       3 |        3 | foo        | bar        | 2021-09-21 15:19:52 |
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+

and the server (UTC time):

mysql> SELECT * FROM wp_video_tracking WHERE user_id=3 AND video_id=3;
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+
| id | created             | last_updated        | user_id | video_id | watch_data | pdt_method | first_pdt           |
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+
|  2 | 2021-09-30 17:30:50 | 2021-09-30 17:30:50 |       3 |        3 | foo        | bar        | 2021-09-21 22:19:52 |
+----+---------------------+---------------------+---------+----------+------------+------------+---------------------+

You'll note the first_pdt values differ by the difference between the two timezones. This is encouraging inasmuch as I think both mysql databases have captured the exact same UTC timestamp for first_pdt, just interpreted differently because of the different timezones. This query returns the same exact value on both machines:

mysql> SELECT UNIX_TIMESTAMP(first_pdt) FROM wp_video_tracking WHERE user_id=3 AND video_id=3;
+---------------------------+
| UNIX_TIMESTAMP(first_pdt) |
+---------------------------+
|                1632262792 |
+---------------------------+

It's discouraging, though, because when I fetch this first_pdt value, the machine's time zone results in a conversion being applied. I suspect that the actual db engine stores an int or something, and the SELECT behavior of the mysql client -- the closest I could get to the actual db -- is to convert this to a string for User Friendliness™.

It is with growing horror that I realize that the conversion might depend on any number of settings:

  • Ubuntu system time
  • PHP.ini default time zone
  • Wordpress timezone setting
  • WP timezone either specified when db connections is made (Denver!) or NOT applied (UTC?)

And still more horror when I realize these values might have some TZ conversion applied when I do any of the following:

  • change mysql timezone setting
  • mysqldump the data and reload into a future mysql server
  • daylight savings time?

For my current application, if I stick a $first_pdt of 2021-09-21T22:19:52+00:00 into the db, I'd like to see that value returned when I retrieve the record, but there doesn't appear to be any time zone data stored in each db record, and, unlike the PHP date() function, the MySQL DATE_FORMAT function doesn't have any time zone specifiers . Unless I' missing something, one would have to cook up some kind of fancy CONCAT query to get the correct timezone offset and this seems like it would entail a time zone conversion. I'm considering storing it as varchar to avoid any conversions at all, but that precludes all the useful date range functions and indexing I might want to use for sorting/limiting/formatting this column.

In PHP, I need to merge an incoming first_pdt value and its associated data array against the previously stored first_pdt and its data array and merge the two arrays -- the first_pdt values are offsets for each data payload and must be compared to align each element of the data payload. That being the case, I'll need to convert both POST and DB first_pdt values to integers for this alignment & merge. As you can imagine, any rounding errors could cause an off-by-one drift as records are repeatedly updated from incoming POST and timezone corrections could introduce far more extreme alignment problems with the data payloads.

I'm gradually starting to think I might be able to avoid conversions on storage because of the extra time zone offset and then avoid conversions on SELECT by casting or by using UNIX_TIMESTAMP. Any other approach like CONVERT_TZ or other time zone specific conversion seems like it might break because of the many settings that might affect this. I'd like my code to work regardless of any time zone settings on client, server, PHP, or mysql.

Can anyone suggest a simple approach to storing a UTC timestamp which provides the convenient formatting and ordinal treatment of mysql timestamp/datetime fields but which avoids unwanted conversion?

I admit I haven't read your entire post, but I don't see any mention of using a DateTime object; that retains the stamp's components as individual fields (including timezone and fractional seconds), parsing the string based on a specific format ('Y-m-d\TH:i:s.vP' in this case, or DateTimeInterface::RFC3339_EXTENDED). Timezones are taken into account when comparing DateTimes, so that 2021-09-21T22:19:51.689+0:00 and 2021-09-21T23:19:51.689+1:00 refer to the same instant. even though their hour and tz components differ. Calling setTimezone() on a DateTime object does what it says without changing the instant the object refers to (i.e. changes to the timezone are mirrored in the other components).

And I don't know enough about MySQL's date/time types to know how it copes with timezone information (apparently it doesn't; the standard SQL type is timestamp with time zone but standard SQL's timezone handling is problematic: timeless dates can't have timezones, but dateless times can even though timezone behaviour is date-dependent due to DST ... they're not really timezones as people use the things but specific UTC offsets, but anyway: ), now that the object distinguishes those fields, you can combine and format them however is necessary, possibly normalising them to a single (UTC, probably) timezone for storage.

sneakyimp That being the case, I'll need to convert both POST and DB first_pdt values to integers for this alignment & merge.

The timestamp format you're using has, varying timezones notwithstanding, the property that the timestamps sort as strings into the same order that they sort chronologically. RFC3339 §5.1

Weedpacket admit I haven't read your entire post, but I don't see any mention of using a DateTime object;

I've used this before, and appreciate how it effectively wraps up the shockingly confusing and difficult issue of dealing with dates and times and timezones into a tidy and useful package. However, the pain point seems to be that these values go a MySQL database, which seems hell-bent on converting them (unless you define columns as integers or strings) and storing them without timezone information and then, when they come back, they show up in one's PHP code as strings.

I am having some luck by:
1) always supplying a full timestamp with timezone to MySQL when inserting. If a conversion must happen, it's better to be explicit about the time zone rather than hoping that OS, PHP, MySQL client, MySQL server, etc. all agree as to what time zone should be applied.
2) always use the UNIX_TIMESTAMP() function when retrieving these timestamp or datetime values. Unless I'm mistaken, this usually returns an int, unmolested by any time zone conversions. Sadly, you can't hand a PHP DateTime object to the database.

Weedpacket timeless dates can't have timezones, but dateless times can even though timezone behaviour is date-dependent due to DST ... they're not really timezones as people use the things but specific UTC offsets, but anyway: ),

I am shocked anew every time I deal with timestamp-sensitive behavior at how complicated time is. Time zones, leap seconds, DST, etc. E.g., all of China has only one time zone.

Weedpacket The timestamp format you're using has, varying timezones notwithstanding, the property that the timestamps sort as strings into the same order that they sort chronologically

This is a helpful observation. I'm still in a bit of a muddle about why I might want timestamp or datetime fields in my DB. It looks like display convenience is one reason for wanting them. MySQL CLI client displays timestamp columns as human-readable dates, which is nice. There's also DATE_ADD and all that sort of thing. A year is not always 365 days nor can you multiply some large number of days by 86400 and hope for the best. I suspect that using MySQL's timestamp and datetime column might offer a lot of the helpful abstraction provided by the PHP DateTime class.

    Like I said: I'm not familiar enough with MySQL.

    Keep in mind that 'timezone' is basically a matter of display, not a property of the timestamp itself. Any internal date/time arithmetic would first convert from the timestamp's internal storage to the current timezone (in the database, that's whatever the database is configured for, in PHP that's whatever PHP is configured for; in both cases it can be set on a per-session basis) and then afterwards converted back again. And that back-and-forth conversion is to handle things like crossing offset changes.

    Like I can go to my db and type:

    SELECT TIMESTAMP WITH TIME ZONE '2004-10-19 10:23:54+02'
    > [timestamp with time zone] "2004-10-19 21:23:54+13"
    
    SELECT TIMESTAMP WITH TIME ZONE '2004-10-19 10:23:54+02' AT TIME ZONE 'Asia/Singapore'
    > [timestamp without time zone] "2004-10-19 16:23:54"
    
    SELECT TIMESTAMP WITH TIME ZONE '2004-10-19 10:23:54+02' AT TIME ZONE 'UTC'
    > [timestamp without time zone] "2004-10-19 08:23:54"

    In the first case, the timezone was included because it came from the database's own configuration and isn't implied by anything in the query. The user may not have had this information.

    In any case, the +02 offset never got stored. And it should go without saying that all four of those strings do represent the same moment.

    In any case, the timezone is not part of the timestamp. The fact that PostgreSQL's epoch is midnight in the UTC timezone is ultimately as arbitrary as the fact that it's in 4713BC.

    Sadly, you can't hand a PHP DateTime object to the database.

    Which is why you can use its format() method to create a string form.

    and then, when they come back, they show up in one's PHP code as strings.

    Like any other nontrivial type. Fortunately, the DateTime class has a method for constructing instances from such strings.

    Write a Reply...