When you see an error code you don’t understand, search for it!
Search for the exact text of your error message (except for any references to filepaths, etc.).
R has millions of users and an incredibly active online community. If you encounter any error message, chances are someone has had to debug that exact error in the past.
Many, many issues will be solved by closing and re-opening R.
Sometimes, things get weird with your global environment, and you just need to start with a clean slate. It can be frustrating, because you don’t know what was causing the issue, but also a relief, because it will be fixed.
Resetting will:
rm(list = ls()
!)When you encounter a bug, try to repeat it. Likely, it came about for a reason.
When you do repeat it, you should repeat it with a minimal reproducible example. This means you should remove as much code and simplify as much data as possible.
Example: did your function break with a large data matrix of real-world data stored within a package? How about if you just use a \(3\times 3\) matrix of \(1\)’s?
For more guidelines, see StackOverflow’s guide to creating minimal reproducible examples.
If you weren’t able to fix a bug the “easy” way (found the solution online or just needed a reset), and you have a minimal reproducible example demonstrating your bug, then you are ready for the really hard part: finding the bug.
Often (not always), once you find a bug, it is rather easy to fix.
traceback()
: call stackSometimes functions(in functions(in functions(in functions(…)))) can get complicated.
Exercise: What will the following code return?
Error in x + 1: non-numeric argument to binary operator
traceback()
: call stackIf you call the traceback()
function immediately after triggering an error, the console will print out a summary of how your program reached that error. This summary is called a call stack.
Error in x + 1: non-numeric argument to binary operator
2: f(x) at #1
1: g("a")
traceback()
: call stackExercise: What will the following code return?
[1] 15
traceback()
: call stackWhat about this code?
[1] 11 12 13
traceback()
: call stackWhat about this code?
Error in d + 10: non-numeric argument to binary operator
Why doesn’t it say error in a + 10
?
traceback()
: call stack5: stop("`d` must be numeric", call. = FALSE)
4: i(c)
3: h(b)
2: g(a)
1: f("a")
traceback()
: call stackRead from bottom to top:
f("a")
g(a)
h(b)
i(c)
i(c)
triggered our error.We now know that our error occurred within i(c)
, but we don’t know where within i(c)
our error occurred.
traceback()
: call stackNote this is done in your Console, not your Editor pane. This is typically not a part of your reproducible workflow, this is you figuring out your own problems until you fix what actually belongs in your workflow.
Once you have the bug fixed, then put the debugged code in your Editor pane. In general, most debugging will be done through the Console.
print(), cat(), str()
: messagesUse print()
statements to test out print()
statements to see the values used during execution.
[1] "a"
Error in d + 10: non-numeric argument to binary operator
This is a quick and easy way to find bugs quickly and should be probably your first step when manually debugging.
print(), cat(), str()
: messagesTake your function and throw in a cat()
statement to see what the function is seeing.
print(), cat(), str()
: messagesWe can similarly use str()
:
chr "a"
Error in d + 10: non-numeric argument to binary operator
More specifically, str()
can be used to see the structure of what the function is seeing. This can often be more informative if the structure is not what you expect (a common source of bugs).
print(), cat(), str()
: messagesThe main downside to these approaches is that it can get messy quickly if you use multiple print statements, and you cannot further investigate the object. However, many bugs come down to typos or other trivial errors, and print statements are often all you need.
browser()
: Interactive debuggerSometimes, it may not be enough to just use print statements and locate a bug. You can get more information and interact with that information using browser()
, an interactive debugger.
Within RStudio, you can also get right to an interactive debugger by clicking Rerun with Debug
.
browser()
: Interactive debuggerAlternatively, we can plug browser()
into our function, as with the print statements.
browser()
: Interactive debuggerAfter you run a function with browser()
, you will be inside of your function with an interactive debugger.
You will know it worked if you see a special prompt: Browse[1]>
.
We can see:
Next, n
: executes the next step in the function. (If you have a variable named n
, use print(n)
to display its value.)
Step into, s
, works like next, but if the next step is a function, it will step into that function so you can explore it interactively. (Easy to get lost!)
Finish, or f
: finishes execution of the current loop or function. Useful to skip over long for loops you don’t need to interact with.
Continue, c
: leaves the debugger and continues execution of the function. Useful if you’ve fixed the bad state and want to check that the function proceeds correctly.
Stop, Q
: exits the debugger and returns to the R prompt.
Enter (no text): repeat the previous interactive debugging command. Defaults to Next n
if you haven’t used any debugging commands yet.
where
: prints stack trace of active calls (the interactive equivalent of traceback
).
ls()
: List the current environment. You can use this to browse everything that the function is able to see at that moment.
str()
, print()
: examine objects as the function sees them.
If you execute the step of a function where the error occurs, you will exit interactive debugging mode, the error message will print, and you will be returned to the R prompt.
This can get annoying, because it stops you from interacting right when you hit the bug.
Strategy:
browser()
and execute until you hit the error. Remember exactly where it occurs.browser()
again and execute again, stopping just before triggering the error.Browse
console to trigger the error without exiting the interactive debugger.s
“Step into” the next function and repeat this process.create_p_norm <- function(p = 2) {
return(
function(x) return(sum(abs(x) ^ p) ^ (1 / p))
)
}
calculate_p_norm <- function(x, p) {
# browser()
norm_fn <- create_p_norm(p)
return(norm_fn(x))
}
calculate_p_norm(1:5, 1)
[1] 15
Within an interactive debugging environment, you can use as.list(environment())
to see what is in the environment.
loop_fun <- function(x) {
# browser()
n <- length(x)
y <- rep(NA, n)
z <- rep(NA, n)
for (i in 1:n) {
y[i] <- x[i] * 2
z[i] <- y[i] * 2
}
return(list("y" = y, "z" = z))
}
loop_fun(1:5)
$y
[1] 2 4 6 8 10
$z
[1] 4 8 12 16 20
$y
[1] 2 0 0 0 0 0 4 0 0 0 0 0 6 0 0 0 0 0 8 0 0 0 0 0 10
$z
[1] 4 0 0 0 0 0 8 0 0 0 0 0 12 0 0 0 0 0 16 0 0 0 0 0 20
In RStudio, you may also use breakpoints by writing your code in an R file. A breakpoint is like a browser()
call, but you avoid having to change your code.
Search online.
Reset.
Repeat.
Locate:
traceback()
print()
, cat()
, str()
browser()
, debug()
, debugonce()
, trace()
A number is a palindrome if it is the same when the digits are reversed (ex. 484). Can you fix the following code to print numbers between 1 and 1000 that are both prime and palindromes?
is_prime <- function(x) {
if (x == 2) {
return(TRUE)
} else if (any(x %% 2:(x-1) == 0)) {
return(FALSE)
} else {
return(TRUE)
}
}
is_palindrome <- function(x) {
x_char <- as.character(x)
x_char <- stringr::str_split(x_char, "")[[1]]
x_rev <- rev(x_char)
return(x_char == x_rev)
}
for (i in 1:1000) {
if (is_prime(i) & is_palindrome(i)) {
print(i)
}
}
[1] 2
[1] 3
[1] 5
[1] 7
Error in if (is_prime(i) & is_palindrome(i)) {: the condition has length > 1