Ocean Heat Content Structure Revealed by Un-Supervised Classification of Hydrographic Profiles

Guillaume Maze1, Herle Mercier2, Ronan Fablet3, Philippe Lenca4, Manuel Lopez Radcenco4, Pierre Tandeo3, Clement Le Goff3 and Charlène Feucher1, (1)IFREMER, LPO, Plouzané, France, (2)CNRS, LOPS, Plouzane, France, (3)Telecom Bretagne, Lab-STICC, Brest, France, (4)Telecom Bretagne, Lab-STICC, France
Abstract:
In the data mining community, unsupervised classification (or clustering) technics are used to reveal and explore the hidden structure of a dataset. Among them, mixture models and in particular Gaussian Mixture Models (GMM), are a very popular tool. With GMM, the statistical distribution of the dataset is decomposed into a weighted sum of Gaussian modes that maximizes the likelihood of the dataset. Multivariate Gaussian distributions characterize a mode in the D-dimensional space of the dataset. This technic is routinely used in atmospheric science to describe weather regimes but is yet to be explored in physical oceanography. Here, we are interested in determining the structure of the North Atlantic ocean temperature field.

Argo profiles in the North atlantic were compressed along their vertical axis in order to reduce the dimensionality of the problem. We then fitted GMMs to this reduced profile dataset according to a Maximum Likelihood criterion using the Expectation-Maximization algorithm. A seven-mode GMM was the most relevant to describe the dataset. Each mode or cluster is characterized by the parameters of a Gaussian distribution (a mean and a covariance matrix). GMM also provides for each profile of the dataset the probability it belongs to a specific cluster. We will show that these informations can be used to describe physically coherent heat reservoirs and their variability.

Indeed, we found that clusters capture the large scale climatological structure of the temperature field. Each of the cluster correspond to physically coherent regions, namely the equatorial, tropical, subtropical, intergyre and subpolar regions and are associated with reference profiles. A hierarchical clustering was applied to characterize the regional variability of the dataset. We will present those clusters and possible use both for scientific analysis of the heat content variability in the North Atlantic and technical validation of the Argo array.