Kernel selection in PROC SVM The support vector machine (SVM) is a flexible classification or regression method by using its many kernels. To apply a SVM, we possibly need to specify a kernel, a regularization parameter c and some kernel parameters like gamma. Besides the selection of regularization parameter c in my previous post, the SVM procedure and the iris flower data set are used here to discuss the kernel selection in SAS.Exploration of the iris flower dataThe iris data is classic for classification exercise. If we use the first two components from Principle Component Analysis (PCA) to compress the four predictors, petal length, petal width, sepal length, ...
Top 10 most powerful functions for PROC SQL ABSTRACTPROC SQL is not only one of the many SAS procedures and also a distinctive subsystem with all common features from SQL (Structured Query Language). Equipped with PROC SQL, SAS upgrades to a full-fledging relational database management system. PROC SQL provides alternative ways to manage data other than the traditional DATA Step and SAS procedures. In addition, SAS’s built-in functions are the add-on tools to increase the power of PROC SQL. In this paper, we illustrate ten popular SAS functions, which facilitate the capacity of PROC SQL in data management and descriptive statistics.INTRODUCTIONStructured Query Language (SQL) is a universal computer ...
Some popular regression procedures in SAS/STAT With the new release of PROC ADAPTIVEREG in SAS 9.4, the tool belt of regressions in SAS/STAT is almost completed. Hope in the future there will be a designated procedure for k-NN in SAS/STAT.. 
Using SAS to find the best k for k-Nearest Neighbor classification
The KFC toy problem: perspectives from four job roles There is an interesting question —There are 5 different types of toys at a KFC restaurant. If you go there, you will get one toy randomly. How many times do you need to go to KFC in order to get all 5 toys?The question is about probabilistic analysis. Different professionals, such as a business analyst, a statistical programmer, a mathematician and a software developer, will have different thinking pathway to solve this problem. Let's see what they would think.1. Business AnalystA business analyst will tend to do scenario analysis at the first step.Best-case scenario:Assume I am so lucky that each ...