<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Modeling a rare target (used undersample) on SAS EMiner in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Modeling-a-rare-target-used-undersample-on-SAS-EMiner/m-p/823338#M10232</link>
    <description>Hi, I am making various models for rare target data. I have made log. Regression, regression, ensemble, and gradient boosting. When I model compare, I find different chosen models for different “selection statistic” metrics. I am wondering which metric would be best to evaluate to decide my final model for this model use case?&lt;BR /&gt;&lt;BR /&gt;Can I find AUROC on Miner as well?&lt;BR /&gt;&lt;BR /&gt;Also, I am wanting to use random forest model as well, but I get an error when I use 20 samples, apply LARS and partition the data, not sure what the step before the Forest node is (because the old version of miner had a node for this but I am using 14.2 and it does not exist), so I get an error when trying to run the forest. It says “must use at least one input or rejected variable”, and I am not sure how to fix this and get the forest to run. Thanks.</description>
    <pubDate>Thu, 14 Jul 2022 15:05:40 GMT</pubDate>
    <dc:creator>DarioM</dc:creator>
    <dc:date>2022-07-14T15:05:40Z</dc:date>
    <item>
      <title>Modeling a rare target (used undersample) on SAS EMiner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Modeling-a-rare-target-used-undersample-on-SAS-EMiner/m-p/823338#M10232</link>
      <description>Hi, I am making various models for rare target data. I have made log. Regression, regression, ensemble, and gradient boosting. When I model compare, I find different chosen models for different “selection statistic” metrics. I am wondering which metric would be best to evaluate to decide my final model for this model use case?&lt;BR /&gt;&lt;BR /&gt;Can I find AUROC on Miner as well?&lt;BR /&gt;&lt;BR /&gt;Also, I am wanting to use random forest model as well, but I get an error when I use 20 samples, apply LARS and partition the data, not sure what the step before the Forest node is (because the old version of miner had a node for this but I am using 14.2 and it does not exist), so I get an error when trying to run the forest. It says “must use at least one input or rejected variable”, and I am not sure how to fix this and get the forest to run. Thanks.</description>
      <pubDate>Thu, 14 Jul 2022 15:05:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Modeling-a-rare-target-used-undersample-on-SAS-EMiner/m-p/823338#M10232</guid>
      <dc:creator>DarioM</dc:creator>
      <dc:date>2022-07-14T15:05:40Z</dc:date>
    </item>
  </channel>
</rss>

