August 2024 updates

This commit is contained in:
Natalie Elphick 2024-08-22 14:34:07 -07:00
parent 84821ad2e5
commit e6fca4e130
6 changed files with 324 additions and 269 deletions

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

View file

@ -2,7 +2,7 @@
title: "Introduction to R Data Analysis"
subtitle: "Part 1"
author: "Natalie Elphick"
date: "May 20th, 2024"
date: "August 26th, 2024"
knit: (function(input, ...) {
rmarkdown::render(
input,
@ -29,8 +29,8 @@ knitr::opts_chunk$set(comment = "")
**Natalie Elphick**
Bioinformatician I
**Yihang Xin (Online TA)**
Software Engineer III
**Min-Gyoung Shin**
Bioinformatician III
## Poll 1
@ -47,6 +47,24 @@ Software Engineer III
- No background in statistics or computing
- No prior experience with programming or R/RStudio
## Learning Objectives
1. Navigate the RStudio environment and understand how R works
2. Understand variable types and data structures
3. Perform data cleaning and transformation in R
4. Create simple visualizations using ggplot2
## Learning R Takes Time!
- **Workshop Pace**: This is an intro, and its okay if everything doesnt click right away.
- **Practice is Key**: Plan to spend extra time on practicing concepts after the workshop.
- **Self-Guided Learning**: Use the materials provided at the end of the workshop to continue at your own pace.
Keep at it—progress comes with persistence!
## Part 1:
1. What is R and why should you use it?
@ -73,25 +91,20 @@ Software Engineer III
- Can easily implement any statistical analysis
- Code serves as a record which enables reproducibility
with minimal effort
- As of March 2023, there were over 19,000 open source packages to extend its
- As of August 2024, there were over 21,000 open source packages to extend its
functionality
- Highly customizable graphics ([ggplot2](https://ggplot2-book.org/))
- Analysis reports ([knitr](https://cran.r-project.org/web/packages/knitr/index.html))
- RNA-seq analysis ([DESeq2](https://bioconductor.org/packages/release/bioc/html/DESeq2.html))
## How does it work?
<section class="shrink">
![Programming](assets/R_lang_hierarchy.png)
</section>
# RStudio
## RStudio
- RStudio is an integrated development
environment (IDE)
- It is an app that makes R code easier to write by providing a
feature rich graphical user interface (GUI)
- An app that makes R code easier to write by providing a feature rich graphical user interface (GUI)
<br>
</br>
@ -109,7 +122,7 @@ feature rich graphical user interface (GUI)
## File types
- **Rscript** files that end in `.R`
- The most basic, a file that contains R code
- The most basic, a file that contains only R code
- **RMarkdown** files that end in `.Rmd`
- Let's create a blank Rscript to see how they work, open RStudio and click:
- File -\> New File -\> R Script
@ -228,7 +241,7 @@ DogBreeds <- c("Labrador Retriever", "Akita", "Bulldog")
## Data Types
- Integer
- Whole numbers (in R denoted with L ex. 1L,2L)
- Whole numbers (denoted with L ex. 1L,2L)
- Numeric
- Decimal numbers
- Logical
@ -312,7 +325,7 @@ x %in% y # Is x in this vector y?
**What is the output of the following code?**
```{r, eval = FALSE}
4 %in% as.character(c(1,2,3,4))
4 %in% c(1, 2, 3, 4)
```
1. TRUE
@ -352,10 +365,15 @@ execution of code
```{r}
dog_breeds <- c("Labrador Retriever", "Akita", "Bulldog")
if ("Akita" %in% dog_breeds) {
print("dog_breeds already contains Akita")
} else {
dog_breeds <- c("Akita", dog_breeds)
}
```
@ -372,13 +390,12 @@ perform a single action
![Functions](assets/functions.png)
## Defining a function
- To define a function we use the function keyword, the output is specified with the **return** keyword:
- To define a function we use the function keyword, the output is specified with the **return** function:
```{r}
add_dog <- function(dog_to_add,
input_vector) {
add_dog <- function(dog_to_add, input_vector) {
if (dog_to_add %in% input_vector) {
message("Already contains this dog")
print("Already contains this dog")
} else {
@ -453,8 +470,16 @@ packages
## Upcoming Workshops
[Single Cell ATAC-Seq Data Analysis Part 2](https://gladstone.org/events/single-cell-atac-seq-data-analysis-part-2-1)
[Intermediate RNA-Seq Analysis Using R](https://gladstone.org/events/intermediate-rna-seq-analysis-using-r-5)
September 10, 2024 9am-12pm PDT
- Check [this link](https://gladstone.org/events?series=data-science-training-program) at the end of the summer for out fall workshop schedule
[Introduction to Statistics, Experimental Design, and Hypothesis Testing](https://gladstone.org/events/introduction-statistics-experimental-design-and-hypothesis-testing-1)
September 10 - September 12, 2024 1-3pm PDT
[Single Cell RNA-Seq Data Analysis](https://gladstone.org/events/single-cell-rna-seq-data-analysis-0)
September 16-September 17, 2024 9am-4pm PDT
- Check [this link](https://gladstone.org/events?series=data-science-training-program) at for the full schedule

View file

@ -2,7 +2,7 @@
title: "Introduction to R Data Analysis"
subtitle: "Part 2"
author: "Natalie Elphick"
date: "May 21st, 2024"
date: "August 27th, 2024"
knit: (function(input, ...) {
rmarkdown::render(
input,
@ -31,11 +31,11 @@ knitr::opts_chunk$set(comment = "")
**Natalie Elphick**
Bioinformatician I
**Michela Traglia (Online TA)**
**Michela Traglia**
Senior Statistician
**Yihang Xin (Online TA)**
Software Engineer III
**Ayushi Agrawal**
Bioinformatician III
# Schedule
@ -179,7 +179,7 @@ ggplot(data = mpg, # Input dataframe
```{r, fig.dim=c(10,4)}
ggplot(data = mpg,
mapping = aes(x = cty, y = hwy)) +
geom_point() +
geom_point(color = "brown") +
geom_smooth(formula = y ~ x, method = "lm")
```
@ -315,11 +315,17 @@ For any bioinformatics specific questions feel free to reach out to the Gladston
## Upcoming Workshops
[Single Cell ATAC-Seq Data Analysis Part 2](https://gladstone.org/events/single-cell-atac-seq-data-analysis-part-2-1)
[Intermediate RNA-Seq Analysis Using R](https://gladstone.org/events/intermediate-rna-seq-analysis-using-r-5)
September 10, 2024 9am-12pm PDT
- Check [this link](https://gladstone.org/events?series=data-science-training-program) at the end of the summer for out fall workshop schedule
[Introduction to Statistics, Experimental Design, and Hypothesis Testing](https://gladstone.org/events/introduction-statistics-experimental-design-and-hypothesis-testing-1)
September 10 - September 12, 2024 1-3pm PDT
- [Gladstone Bioinformatics Workshops](https://github.com/gladstone-institutes/Bioinformatics-Workshops/wiki) - workshop wiki page for all of the workshops we offer
[Single Cell RNA-Seq Data Analysis](https://gladstone.org/events/single-cell-rna-seq-data-analysis-0)
September 16-September 17, 2024 9am-4pm PDT
- Check [this link](https://gladstone.org/events?series=data-science-training-program) at for the full schedule

File diff suppressed because it is too large Load diff

View file

@ -22,7 +22,6 @@
}
/* Add horizontal scrolling to all code outputs */
.reveal pre code.output {
@ -43,12 +42,12 @@ pre, code, kbd, samp {
/* Bold slide titles and change color */
.reveal h2 {
font-weight: bold !important;
color: #9c0366;
color: #0072B2;
}
/* Bold slide titles and change color */
.reveal h1 {
font-weight: bold !important;
color: #9c0366;
color: #0072B2;
}
.reveal .slides>section:first-child h2 {
color: #333;
@ -57,7 +56,7 @@ pre, code, kbd, samp {
/* Custom slide title */
.my-title-slide h1 {
font-weight: bold;
color: #9c0366;
color: #0072B2;
}
.my-title-slide h2 {
color: #333;
@ -68,7 +67,7 @@ font-weight: normal !important;
.reveal .slides>section:first-child h1 {
font-weight: bold !important;
color: #9c0366;
color: #0072B2;
}
/* Increase the spacing between list items */
@ -136,7 +135,7 @@ small {
color: #0c74dc;
}
/* Change link color to magenta on hover */
/* Change link color to darker blue on hover */
.reveal a:hover {
color: #9c0366 !important;
color: #0072B2 !important;
}