{"id":2600,"date":"2026-01-01T00:01:43","date_gmt":"2026-01-01T05:01:43","guid":{"rendered":"https:\/\/mathvoices.ams.org\/featurecolumn\/?p=2600"},"modified":"2026-01-02T13:42:42","modified_gmt":"2026-01-02T18:42:42","slug":"information-insight-and-the-problem-with-parameters","status":"publish","type":"post","link":"https:\/\/mathvoices.ams.org\/featurecolumn\/2026\/01\/01\/information-insight-and-the-problem-with-parameters\/","title":{"rendered":"Information, Insight, and the Problem With Parameters"},"content":{"rendered":"<p><span id=\"pullQuote\"><em>Could we possibly gain more insight by trading away even more information?<\/em><\/span><\/p>\n<h1 class=\"headlineText\">Information, Insight, and the Problem With<br \/>\nParameters<\/h1>\n<p><b>Anil Venkatesh<br \/>\nAdelphi University<\/b><\/p>\n<h1 class=\"unnumbered\" id=\"introduction\">Introduction<\/h1>\n<p>I have two data sets for us to consider. Both consist of observations<br \/>\nof a single variable. <a href=\"https:\/\/health.data.ny.gov\/Health\/New-York-State-Statewide-COVID-19-Testing\/jvfi-ffup\/data_preview\">Data Set 1<\/a> holds the daily number of positive<br \/>\nCOVID-19 tests in Kings County, NY, since March 2020. <a href=\"https:\/\/www.kaggle.com\/datasets\/johnjdavisiv\/urinary-biomarkers-for-pancreatic-cancer\">Data Set 2<\/a> holds<br \/>\nthe LYVE1 level of 535 patients\u2014this is the urinalysis concentration of<br \/>\na certain protein that may relate to the metastasis of pancreatic<br \/>\ncancer.<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/01_simple-plots.png?resize=600%2C472&#038;ssl=1\" alt=\"Two graphs: daily new covid cases in King County vs. time, and LYVE vs. patient ID. \" width=\"600\" height=\"472\" class=\"aligncenter size-full wp-image-2602\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/01_simple-plots.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/01_simple-plots.png?resize=300%2C236&amp;ssl=1 300w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/01_simple-plots.png?resize=465%2C366&amp;ssl=1 465w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Line plots of Data Sets 1 and 2.<\/figcaption><\/figure>\n<p>One of these plots is a lot more informative than the other. When we<br \/>\nplot observations from left to right, there\u2019s an implied hypothesis that<br \/>\nthe order of the observations has an association with the observations<br \/>\nthemselves. This is clearly true in the case of positive COVID-19 tests<br \/>\nover time, but it\u2019s clearly false in the case of LYVE1 levels<br \/>\nvs.\u00a0patient ID. Even if we can identify a trend in the second plot, it<br \/>\ncan\u2019t possibly carry any insight.<\/p>\n<p>While both data sets ostensibly only measure one variable, the<br \/>\nCOVID-19 data set has an implicit time-ordering that allows us to<br \/>\nvisualize it in two dimensions. With no second variable in the<br \/>\nurinalysis data set, we\u2019re stuck building a one-dimensional scatter plot<br \/>\n(is that a mathematical koan?) We can certainly mark a number line at<br \/>\neach value that\u2019s contained in the data set, but let\u2019s extend these<br \/>\nmarks into vertical lines for readability.<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/02_novelty-plot.png?resize=600%2C491&#038;ssl=1\" alt=\"A vertical line for each LYVE level between 0 and 25. The data appears concentrated near 0.\" width=\"600\" height=\"491\" class=\"aligncenter size-full wp-image-2603\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/02_novelty-plot.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/02_novelty-plot.png?resize=300%2C246&amp;ssl=1 300w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/02_novelty-plot.png?resize=465%2C381&amp;ssl=1 465w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Novelty plot (or one-dimensional scatterplot) of the<br \/>\nurinalysis data set.<\/figcaption><\/figure>\n<p>While this visualization of LYVE1 levels no longer implies a spurious<br \/>\nassociation with patient ID, it hasn\u2019t greatly improved our insight<br \/>\nabout the data. As you look at this barcode-like plot, you can\u2019t help<br \/>\nbut allow some of the lines to run together into blobs. Aggregating like<br \/>\nthis does lose some information, but it allows you to gain a little<br \/>\ninsight about the ranges of values of LYVE1 that are most commonly<br \/>\nrepresented in the data set. In jargon:\u00a0aggregating nearby observations<br \/>\ngives an approximate measurement of the <em>density function<\/em> of the<br \/>\nLYVE1 variable. Could we possibly gain more insight by trading away even<br \/>\nmore information?<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?resize=600%2C595&#038;ssl=1\" alt=\"Histograms with different bin widths. As the bin size grows, the data appears less jagged and seems to approach a steadily decreasing pattern. With a single bin, of course, the histogram is flat.\" width=\"600\" height=\"595\" class=\"aligncenter size-full wp-image-2604\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?resize=300%2C298&amp;ssl=1 300w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?resize=150%2C150&amp;ssl=1 150w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?resize=465%2C461&amp;ssl=1 465w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/03_histograms.png?resize=504%2C500&amp;ssl=1 504w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Histogram representations of LYVE1 (various bin<br \/>\nwidths).<\/figcaption><\/figure>\n<p>A <em>histogram<\/em> does exactly this:\u00a0break up the axis into bins<br \/>\nof equal width, then count the number of observations in each bin and<br \/>\nplot these counts. The choice of bin width has a profound effect on the<br \/>\nvisualization and attendant insights (or lack thereof). While widening<br \/>\nthe bins can smooth out the random jiggles of the observations, when<br \/>\ntaken too far it inevitably smooths out the entire data set into a blob.<br \/>\nAs we saw before, narrowing the bins too aggressively results in a<br \/>\nnearly useless barcode pattern. Is there such thing as a perfect bin<br \/>\nwidth? Students of calculus will recall that any continuous function on<br \/>\na closed and bounded interval attains a maximum somewhere in that<br \/>\ninterval; here, we wish to know if the \u201cinsightfulness\u201d of a histogram<br \/>\n(holistically construed) can be maximized by some certain bin width<br \/>\nbetween 0 and the range of the data. But is this \u201cinsightfulness\u201d a<br \/>\ncontinuous function of bin width? And if so, can we define it concretely<br \/>\nenough to solve the optimization problem?<\/p>\n<p>Much research has been done on the optimal bin width question. As the<br \/>\ndefault option in Microsoft Excel, Scott\u2019s normal reference rule is<br \/>\nsurely the most ubiquitous approach. The rule prescribes a bin width<br \/>\nproportional to $n^{1\/3}$, where $n$ is the size of the data set, and<br \/>\nactually gives an exact constant of proportionality as well. The<br \/>\n\u201dinsightfulness\u201d function in Scott\u2019s rule is the <em>integrated mean<br \/>\nsquare error<\/em> between the histogram and the true distribution of the<br \/>\ndata\u2026or at least, a normal distribution that most closely matches the<br \/>\ndata. Other popular rules make similar assumptions about the properties<br \/>\nof the data distribution, sometimes even recovering the $n^{1\/3}$ rule but with a<br \/>\ndifferent constant of proportionality. In all these cases, the author<br \/>\nproposed a working definition of insightfulness and then made enough<br \/>\nassumptions about the data in order to reach a closed-form rule for bin<br \/>\nwidth. Unfortunately, different assumptions arrive at different<br \/>\nformulas! This means that we are still stuck making a choice.<\/p>\n<h1 class=\"unnumbered\" id=\"the-problem-with-parameters\">The Problem With<br \/>\nParameters<\/h1>\n<p>A mathematical model that requires the user to choose a value (like<br \/>\nbin width) is called a <em>parametric<\/em> model, i.e.\u00a0involving one or<br \/>\nmore undetermined parameters. Every time we choose the value of a<br \/>\nparameter, we influence the model and potentially change the nature of<br \/>\nthe insight that it provides. This means that every parametric model has<br \/>\nthe user\u2019s thumb on the scale\u2014a problem if we want to gain unbiased<br \/>\ninsight from data. The fewer parameter values we have to pick, the less<br \/>\nour actions will bias the results, right? In what circumstances can we<br \/>\nopt out of choosing a parameter value entirely?<\/p>\n<p>Before we go any further, I have some bad news about the parameter<br \/>\nspace of histograms. While we\u2019ve been doing our best to pick a good bin<br \/>\nwidth, there\u2019s been another parameter lurking in the shadows. What about<br \/>\nsliding all the bins left or right? This \u201cbin offset\u201d parameter can<br \/>\nclearly affect the visualization as observations hop from one bin to<br \/>\nanother. One option is to place the bins arbitrarily and hope that the<br \/>\nresult is good enough, but let\u2019s consider a way of entirely opting out<br \/>\nof this parameter choice. With apologies to <a href=\"https:\/\/www.antoinebuteau.com\/lessons-from-george-polya\/\">P\u00f3lya<\/a>, I humbly<br \/>\npropose:\u00a0<em>When you have two things to say, instead say infinitely<br \/>\nmany things and take the average.<\/em><\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/04_averaged-histogram.png?resize=600%2C470&#038;ssl=1\" alt=\"The offset-averaged curve has various local maxima and minima, but generally seems to be decreasing as LYVE value increases.\" width=\"600\" height=\"470\" class=\"aligncenter size-full wp-image-2605\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/04_averaged-histogram.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/04_averaged-histogram.png?resize=300%2C235&amp;ssl=1 300w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/04_averaged-histogram.png?resize=465%2C364&amp;ssl=1 465w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Histogram of LYVE1 together with its offset-averaged<br \/>\ncurve.<\/figcaption><\/figure>\n<p>By averaging histograms across all possible bin offsets, we<br \/>\nsuccessfully correct for much of the edge effects inherent to the<br \/>\nhistogram. If a cluster of observations happens to land right on the<br \/>\nedge of a bin, this may substantially distort the histogram but it won\u2019t<br \/>\nupset the offset-averaged curve. While we built this curve by thinking<br \/>\nabout sliding the histogram bins around, it can be constructed<br \/>\nequivalently by defining a triangular function of height 1 and base of 2<br \/>\nbin widths, then centering a copy of this function at each data point<br \/>\nand adding everything up (that\u2019s a nice exercise to work out!)<br \/>\nFormulated this way, we\u2019ve actually described an example of <em>kernel<br \/>\ndensity estimation<\/em> (KDE). A <em>kernel<\/em> is a family of<br \/>\nfunctions that smoothly transitions from an infinitesimally narrow spike<br \/>\nto a wide, flat form. Generally, we want the integral of the kernel to<br \/>\nremain constant throughout this transition. This makes the kernel a<br \/>\nmodel of diffusion of mass or heat from a point source to the<br \/>\nsurrounding environment.<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/05_triangle-kernel.png?resize=600%2C470&#038;ssl=1\" alt=\"Isosceles triangles with the same area may have a wide base and shallow peak, or a narrow base and a high peak\" width=\"600\" height=\"470\" class=\"aligncenter size-full wp-image-2606\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/05_triangle-kernel.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/05_triangle-kernel.png?resize=300%2C235&amp;ssl=1 300w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/05_triangle-kernel.png?resize=465%2C364&amp;ssl=1 465w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Five members of a triangle kernel family of mass<br \/>\n1.<\/figcaption><\/figure>\n<p>In kernel density estimation, we center an identical kernel at each<br \/>\ndata point and then allow all the kernels to begin transitioning,<br \/>\nintermingling with each other as they collectively disperse their mass.<br \/>\nWhen configured well, KDE yields smooth, highly interpretable density<br \/>\ncurves. It is implemented by Python\u2019s seaborn library as a standalone<br \/>\nfunction and an optional argument when forming a histogram, and by a<br \/>\nbuilt-in function in R. As the underlying concept of heat diffusion<br \/>\nworks in any ambient dimension, KDE can also be used on multidimensional<br \/>\ndata and is particularly good for heatmap visualizations over two<br \/>\npredictor variables.<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/06_kde.png?resize=600%2C655&#038;ssl=1\" alt=\"Six graphs showing that as the diffusion time increases, the kernel density curve becomes smoother and smoother, until at last it&#039;s close to a horizontal line\" width=\"600\" height=\"655\" class=\"aligncenter size-full wp-image-2607\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/06_kde.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/06_kde.png?resize=275%2C300&amp;ssl=1 275w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/06_kde.png?resize=465%2C508&amp;ssl=1 465w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/06_kde.png?resize=458%2C500&amp;ssl=1 458w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Kernel density estimates of LYVE1 (various diffusion<br \/>\ntimes).<\/figcaption><\/figure>\n<p>Depending on how long we allow the diffusion process to run, a<br \/>\nvariety of KDE models can result (and they\u2019re suspiciously reminiscent<br \/>\nof the variety of histograms we saw before.) Clearly, it all comes down<br \/>\nto when we choose to hit \u201cpause.\u201d If we stop too soon, the kernels will<br \/>\nnot have had enough time to intermingle and a sparse, jagged form will<br \/>\nresult. If we stop too late, the kernels will have distributed their<br \/>\nmass into one big blob (or even worse, a big flat pancake). Just as with<br \/>\nbin width optimization, there is a body of literature on the optimal<br \/>\ndispersion time in KDE. However, another more serious problem<br \/>\narises:\u00a0the optimal dispersion time depends on the local density of the<br \/>\ndata. In portions of the data set with many closely-packed data points,<br \/>\nthe kernels intermingle rapidly to form one dense clump; by comparison,<br \/>\nthe kernels of isolated data points take a lot longer to intermingle<br \/>\nwith the others. Generally speaking, dispersion time should be longer<br \/>\nwherever the data set is sparse, and shorter wherever the data set is<br \/>\ndense. But\u2026I thought the whole point of KDE was to estimate the density<br \/>\nof the data! If the optimal KDE algorithm requires prior knowledge of<br \/>\nthe density of the data, then what are we even doing here?<\/p>\n<figure>\n<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/07_attribute-time-kde.png?resize=600%2C603&#038;ssl=1\" alt=\"The heatmap has narrow streaks of white and magenta near the bottom that slowly merge to swathes of near-uniform shades of white or pink at the top\" width=\"600\" height=\"603\" class=\"aligncenter size-full wp-image-2608\" srcset=\"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/07_attribute-time-kde.png?w=600&amp;ssl=1 600w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/07_attribute-time-kde.png?resize=150%2C150&amp;ssl=1 150w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/07_attribute-time-kde.png?resize=465%2C467&amp;ssl=1 465w, https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2026\/01\/07_attribute-time-kde.png?resize=498%2C500&amp;ssl=1 498w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption>Illustration of mixing time depending on local<br \/>\ndensity.<\/figcaption><\/figure>\n<p>Each row of this heatmap is a stage of KDE, normalized by its maximum<br \/>\nvalue. Reading from the bottom, we see how the white (high-density)<br \/>\nkernels disperse and gradually merge with surrounding regions. Note that<br \/>\ntight clusters of kernels merge together much faster than isolated<br \/>\nkernels. This effect is particularly pronounced in data sets with many<br \/>\nrepeated values, a common feature of medical and sociological data<br \/>\nbecause of the prevalence of ordinal, non-numeric attributes<br \/>\n(e.g.\u00a0Likert scale, Apgar score).<\/p>\n<h1 class=\"unnumbered\" id=\"in-search-of-insight\">In Search of<br \/>\nInsight<\/h1>\n<p>In principle, the optimal KDE model might need a distinct<br \/>\ndiffusion-rate parameter <em>for each data instance<\/em>. What started<br \/>\nas an effort to remove an unwanted parameter has now saddled us with<br \/>\nmore parameters than we know what to do with! This brings us all the way<br \/>\nback to the humble histogram with its bin width parameter, together with<br \/>\na light touch of KDE to smooth out the offset bias. If there\u2019s a moral<br \/>\nhere, it\u2019s this:\u00a0to gain insight, you must lose information.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Could we possibly gain more insight by trading away even more information? Information, Insight, and the Problem With Parameters Anil Venkatesh Adelphi University Introduction I have two data sets for us to consider. Both consist of observations of a single variable. Data Set 1 holds the daily number of positive<span class=\"more-link\"><a href=\"https:\/\/mathvoices.ams.org\/featurecolumn\/2026\/01\/01\/information-insight-and-the-problem-with-parameters\/\">Read More &rarr;<\/a><\/span><\/p>\n","protected":false},"author":2,"featured_media":2037,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[213,109,25],"tags":[214,215,153],"class_list":["entry","author-uwhitcher","post-2600","post","type-post","status-publish","format-standard","has-post-thumbnail","category-213","category-anil-venkatesh","category-probability-and-statistics","tag-data-visualization","tag-kernel-density-estimation","tag-statistics"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/mathvoices.ams.org\/featurecolumn\/wp-content\/uploads\/sites\/2\/2024\/07\/cropped-FC1380x500x2.png?fit=1380%2C288&ssl=1","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/posts\/2600","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/comments?post=2600"}],"version-history":[{"count":6,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/posts\/2600\/revisions"}],"predecessor-version":[{"id":2615,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/posts\/2600\/revisions\/2615"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/media\/2037"}],"wp:attachment":[{"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/media?parent=2600"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/categories?post=2600"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mathvoices.ams.org\/featurecolumn\/wp-json\/wp\/v2\/tags?post=2600"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}