<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Machine Learning on Statistics @ Home</title>
    <link>http://statsathome.com/tags/machine-learning/</link>
    <description>Recent content in Machine Learning on Statistics @ Home</description>
    <generator>Hugo</generator>
    <language>en-EN</language>
    <managingEditor>stats.at.home@gmail.com (Justin and Rachel Silverman)</managingEditor>
    <webMaster>stats.at.home@gmail.com (Justin and Rachel Silverman)</webMaster>
    <copyright>(c) 2017 Justin and Rachel Silverman</copyright>
    <lastBuildDate>Sun, 13 Aug 2017 00:00:00 +0000</lastBuildDate>
    <atom:link href="http://statsathome.com/tags/machine-learning/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>A New(?) Regression Clustering Algorithm</title>
      <link>http://statsathome.com/2017/08/13/a-new-functional-clustering-algorithm/</link>
      <pubDate>Sun, 13 Aug 2017 00:00:00 +0000</pubDate><author>stats.at.home@gmail.com (Justin and Rachel Silverman)</author>
      <guid>http://statsathome.com/2017/08/13/a-new-functional-clustering-algorithm/</guid>
      <description>&lt;div id=&#34;TOC&#34;&gt;&#xA;&lt;ul&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#motivation&#34;&gt;Motivation&lt;/a&gt;&lt;/li&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#my-solution---hybrid-k-meanslinear-regression-with-transformation&#34;&gt;My Solution - Hybrid K-Means/Linear-Regression with Transformation&lt;/a&gt;&lt;/li&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#starting-on-boring-simulated-data&#34;&gt;Starting on Boring Simulated Data&lt;/a&gt;&lt;/li&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#now-a-more-interesting-simulated-dataset&#34;&gt;Now a more interesting simulated dataset&lt;/a&gt;&lt;/li&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#more-realistic-presense-of-observational-noise&#34;&gt;More realistic, presense of observational noise&lt;/a&gt;&lt;/li&gt;&#xA;&lt;li&gt;&lt;a href=&#34;#conclusions-and-future-directions&#34;&gt;Conclusions and future directions&lt;/a&gt;&lt;/li&gt;&#xA;&lt;/ul&gt;&#xA;&lt;/div&gt;&#xA;&#xA;&lt;div id=&#34;motivation&#34; class=&#34;section level1&#34;&gt;&#xA;&lt;h1&gt;Motivation&lt;/h1&gt;&#xA;&lt;p&gt;I am a fan of the &lt;a href=&#34;https://stackexchange.com/&#34;&gt;Stack Exchange forums&lt;/a&gt;. In particular, I like &lt;a href=&#34;https://stats.stackexchange.com/&#34;&gt;Cross Validated&lt;/a&gt; and &lt;a href=&#34;https://stackoverflow.com/&#34;&gt;Stack Overflow&lt;/a&gt;. An &lt;a href=&#34;https://stats.stackexchange.com/questions/297689/method-to-group-linear-features-in-a-graph/297745#297745&#34;&gt;interesting question regarding clustering&lt;/a&gt; was posted recently. Essentially someone had the following dataset.&lt;/p&gt;&#xA;&lt;pre class=&#34;r&#34;&gt;&lt;code&gt;plot(Retirees)&lt;/code&gt;&lt;/pre&gt;&#xA;&lt;p&gt;&lt;img src=&#34;http://statsathome.com/post/2017-08-13-a-new-functional-clustering-algorithm_files/figure-html/unnamed-chunk-2-1.png&#34; width=&#34;672&#34; /&gt;&lt;/p&gt;&#xA;&lt;p&gt;Essentially the poster wanted a way of clustering the observations into the “lines” that are fairly easy to observe in the data. I am going to ignore the fact that these lines are actually the result of artifact (e.g., conversion of discrete values to percentages and then plotting the percentages vs. a variable used to calculate the percentages) and just pretend they are real as I think its still an interesting problem. I am actually going to simulate some non-artifactual data and use this as well.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
