该暑期课题项目包括两部分: 第一部分,基础知识和技能,包括数据分析,建立模型和计算机编程。上课通过Zoom视讯会议,7月8日至8月13日,每周六,周日北京时间晚7-10点,共6周,36小时。 第二部分,8月14日至10月30日,课题研究,通过与学生沟通选定课题(可能的课题方向包括经济,金融,社会学,心理学,医疗卫生,等), 文献调研确定研究方向, 实验设计和数据收集,建立模型并做出详实的数据分析,得出专业性的结果和讨论,撰写,修改并完成英文研究论文,递交学术竞赛,专业出版物及专业会议。
Data Analysis and Modeling Using SAS
Objective:
The purpose of the course is to introduce SAS programming in data analysis and modeling. It aims to meet the basic data manipulation requirement for Biostatistics, Regression and Econometrics classes and healthcare research.
Class Materials:
Text book: The Little SAS Book by LD Delwiche and SJ Slaughter
Outline of Contents:
Section 1: Introduction to SAS (windows, programs, comments)
Key codes: data, proc, “;”, /* */
Practice: Programming tips and pull-down menus use
Section 2: Getting data into SAS
Key codes: datalines, infile+input, proc import
Practice: infile data with missing values in txt file
Section 3: Subsetting data and conditional clauses
Key codes: where, when in, if then do; end
Practice: Dummy variable creation for 4 American regions and continuous variable converted into categorical variable
Section 4: Combining SAS data sets
Key codes: set, merge,