src/librustc/dep_graph/README.md

   1 # Dependency graph for incremental compilation
   2
   3 This module contains the infrastructure for managing the incremental
   4 compilation dependency graph. This README aims to explain how it ought
   5 to be used. In this document, we'll first explain the overall
   6 strategy, and then share some tips for handling specific scenarios.
   7
   8 The high-level idea is that we want to instrument the compiler to
   9 track which parts of the AST and other IR are read/written by what.
  10 This way, when we come back later, we can look at this graph and
  11 determine what work needs to be redone.
  12
  13 ### The dependency graph
  14
  15 The nodes of the graph are defined by the enum `DepNode`. They represent
  16 one of three things:
  17
  18 1. HIR nodes (like `Hir(DefId)`) represent the HIR input itself.
  19 2. Data nodes (like `ItemSignature(DefId)`) represent some computed
  20    information about a particular item.
  21 3. Procedure notes (like `CoherenceCheckImpl(DefId)`) represent some
  22    procedure that is executing. Usually this procedure is
  23    performing some kind of check for errors. You can think of them as
  24    computed values where the value being computed is `()` (and the
  25    value may fail to be computed, if an error results).
  26
  27 An edge `N1 -> N2` is added between two nodes if either:
  28
  29 - the value of `N1` is used to compute `N2`;
  30 - `N1` is read by the procedure `N2`;
  31 - the procedure `N1` writes the value `N2`.
  32
  33 The latter two conditions are equivalent to the first one if you think
  34 of procedures as values.
  35
  36 ### Basic tracking
  37
  38 There is a very general strategy to ensure that you have a correct, if
  39 sometimes overconservative, dependency graph. The two main things you have
  40 to do are (a) identify shared state and (b) identify the current tasks.
  41
  42 ### Identifying shared state
  43
  44 Identify "shared state" that will be written by one pass and read by
  45 another. In particular, we need to identify shared state that will be
  46 read "across items" -- that is, anything where changes in one item
  47 could invalidate work done for other items. So, for example:
  48
  49 1. The signature for a function is "shared state".
  50 2. The computed type of some expression in the body of a function is
  51    not shared state, because if it changes it does not itself
  52    invalidate other functions (though it may be that it causes new
  53    monomorphizations to occur, but that's handled independently).
  54
  55 Put another way: if the HIR for an item changes, we are going to
  56 recompile that item for sure. But we need the dep tracking map to tell
  57 us what *else* we have to recompile. Shared state is anything that is
  58 used to communicate results from one item to another.
  59
  60 ### Identifying the current task
  61
  62 The dep graph always tracks a current task: this is basically the
  63 `DepNode` that the compiler is computing right now. Typically it would
  64 be a procedure node, but it can also be a data node (as noted above,
  65 the two are kind of equivalent).
  66
  67 You set the current task by calling `dep_graph.in_task(node)`. For example:
  68
  69 ```rust
  70 let _task = tcx.dep_graph.in_task(DepNode::Privacy);
  71 ```
  72
  73 Now all the code until `_task` goes out of scope will be considered
  74 part of the "privacy task".
  75
  76 The tasks are maintained in a stack, so it is perfectly fine to nest
  77 one task within another. Because pushing a task is considered to be
  78 computing a value, when you nest a task `N2` inside of a task `N1`, we
  79 automatically add an edge `N2 -> N1` (since `N1` presumably needed the
  80 result of `N2` to complete):
  81
  82 ```rust
  83 let _n1 = tcx.dep_graph.in_task(DepNode::N1);
  84 let _n2 = tcx.dep_graph.in_task(DepNode::N2);
  85 // this will result in an edge N1 -> n2
  86 ```
  87
  88 ### Ignore tasks
  89
  90 Although it is rarely needed, you can also push a special "ignore"
  91 task:
  92
  93 ```rust
  94 let _ignore = tc.dep_graph.in_ignore();
  95 ```
  96
  97 This will cause all read/write edges to be ignored until it goes out
  98 of scope or until something else is pushed. For example, we could
  99 suppress the edge between nested tasks like so:
 100
 101 ```rust
 102 let _n1 = tcx.dep_graph.in_task(DepNode::N1);
 103 let _ignore = tcx.dep_graph.in_ignore();
 104 let _n2 = tcx.dep_graph.in_task(DepNode::N2);
 105 // now no edge is added
 106 ```
 107
 108 ### Tracking reads and writes
 109
 110 We need to identify what shared state is read/written by the current
 111 task as it executes. The most fundamental way of doing that is to invoke
 112 the `read` and `write` methods on `DepGraph`:
 113
 114 ```rust
 115 // Adds an edge from DepNode::Hir(some_def_id) to the current task
 116 tcx.dep_graph.read(DepNode::Hir(some_def_id))
 117
 118 // Adds an edge from the current task to DepNode::ItemSignature(some_def_id)
 119 tcx.dep_graph.write(DepNode::ItemSignature(some_def_id))
 120 ```
 121
 122 However, you should rarely need to invoke those methods directly.
 123 Instead, the idea is to *encapsulate* shared state into some API that
 124 will invoke `read` and `write` automatically. The most common way to
 125 do this is to use a `DepTrackingMap`, described in the next section,
 126 but any sort of abstraction barrier will do. In general, the strategy
 127 is that getting access to information implicitly adds an appropriate
 128 `read`. So, for example, when you use the
 129 `dep_graph::visit_all_items_in_krate` helper method, it will visit
 130 each item `X`, start a task `Foo(X)` for that item, and automatically
 131 add an edge `Hir(X) -> Foo(X)`. This edge is added because the code is
 132 being given access to the HIR node for `X`, and hence it is expected
 133 to read from it. Similarly, reading from the `tcache` map for item `X`
 134 (which is a `DepTrackingMap`, described below) automatically invokes
 135 `dep_graph.read(ItemSignature(X))`.
 136
 137 **Note:** adding `Hir` nodes requires a bit of caution due to the
 138 "inlining" that old trans and constant evaluation still use. See the
 139 section on inlining below.
 140
 141 To make this strategy work, a certain amount of indirection is
 142 required. For example, modules in the HIR do not have direct pointers
 143 to the items that they contain. Rather, they contain node-ids -- one
 144 can then ask the HIR map for the item with a given node-id. This gives
 145 us an opportunity to add an appropriate read edge.
 146
 147 #### Explicit calls to read and write when starting a new subtask
 148
 149 One time when you *may* need to call `read` and `write` directly is
 150 when you push a new task onto the stack, either by calling `in_task`
 151 as shown above or indirectly, such as with the `memoize` pattern
 152 described below. In that case, any data that the task has access to
 153 from the surrounding environment must be explicitly "read". For
 154 example, in `librustc_typeck`, the collection code visits all items
 155 and, among other things, starts a subtask producing its signature
 156 (what follows is simplified pseudocode, of course):
 157
 158 ```rust
 159 fn visit_item(item: &hir::Item) {
 160     // Here, current subtask is "Collect(X)", and an edge Hir(X) -> Collect(X)
 161     // has automatically been added by `visit_all_items_in_krate`.
 162     let sig = signature_of_item(item);
 163 }
 164
 165 fn signature_of_item(item: &hir::Item) {
 166     let def_id = tcx.map.local_def_id(item.id);
 167     let task = tcx.dep_graph.in_task(DepNode::ItemSignature(def_id));
 168     tcx.dep_graph.read(DepNode::Hir(def_id)); // <-- the interesting line
 169     ...
 170 }
 171 ```
 172
 173 Here you can see that, in `signature_of_item`, we started a subtask
 174 corresponding to producing the `ItemSignature`. This subtask will read from
 175 `item` -- but it gained access to `item` implicitly. This means that if it just
 176 reads from `item`, there would be missing edges in the graph:
 177
 178     Hir(X) --+ // added by the explicit call to `read`
 179       |      |
 180       |      +---> ItemSignature(X) -> Collect(X)
 181       |                                 ^
 182       |                                 |
 183       +---------------------------------+ // added by `visit_all_items_in_krate`
 184
 185 In particular, the edge from `Hir(X)` to `ItemSignature(X)` is only
 186 present because we called `read` ourselves when entering the `ItemSignature(X)`
 187 task.
 188
 189 So, the rule of thumb: when entering a new task yourself, register
 190 reads on any shared state that you inherit. (This actually comes up
 191 fairly infrequently though: the main place you need caution is around
 192 memoization.)
 193
 194 #### Dependency tracking map
 195
 196 `DepTrackingMap` is a particularly convenient way to correctly store
 197 shared state. A `DepTrackingMap` is a special hashmap that will add
 198 edges automatically when `get` and `insert` are called. The idea is
 199 that, when you get/insert a value for the key `K`, we will add an edge
 200 from/to the node `DepNode::Variant(K)` (for some variant specific to
 201 the map).
 202
 203 Each `DepTrackingMap` is parameterized by a special type `M` that
 204 implements `DepTrackingMapConfig`; this trait defines the key and value
 205 types of the map, and also defines a fn for converting from the key to
 206 a `DepNode` label. You don't usually have to muck about with this by
 207 hand, there is a macro for creating it. You can see the complete set
 208 of `DepTrackingMap` definitions in `librustc/middle/ty/maps.rs`.
 209
 210 As an example, let's look at the `adt_defs` map. The `adt_defs` map
 211 maps from the def-id of a struct/enum to its `AdtDef`. It is defined
 212 using this macro:
 213
 214 ```rust
 215 dep_map_ty! { AdtDefs: ItemSignature(DefId) -> ty::AdtDefMaster<'tcx> }
 216 //            ~~~~~~~  ~~~~~~~~~~~~~ ~~~~~     ~~~~~~~~~~~~~~~~~~~~~~
 217 //               |           |      Key type       Value type
 218 //               |    DepNode variant
 219 //      Name of map id type
 220 ```
 221
 222 this indicates that a map id type `AdtDefs` will be created. The key
 223 of the map will be a `DefId` and value will be
 224 `ty::AdtDefMaster<'tcx>`. The `DepNode` will be created by
 225 `DepNode::ItemSignature(K)` for a given key.
 226
 227 Once that is done, you can just use the `DepTrackingMap` like any
 228 other map:
 229
 230 ```rust
 231 let mut map: DepTrackingMap<M> = DepTrackingMap::new(dep_graph);
 232 map.insert(key, value); // registers dep_graph.write
 233 map.get(key; // registers dep_graph.read
 234 ```
 235
 236 #### Memoization
 237
 238 One particularly interesting case is memoization. If you have some
 239 shared state that you compute in a memoized fashion, the correct thing
 240 to do is to define a `RefCell<DepTrackingMap>` for it and use the
 241 `memoize` helper:
 242
 243 ```rust
 244 map.memoize(key, || /* compute value */)
 245 ```
 246
 247 This will create a graph that looks like
 248
 249     ... -> MapVariant(key) -> CurrentTask
 250
 251 where `MapVariant` is the `DepNode` variant that the map is associated with,
 252 and `...` are whatever edges the `/* compute value */` closure creates.
 253
 254 In particular, using the memoize helper is much better than writing
 255 the obvious code yourself:
 256
 257 ```
 258 if let Some(result) = map.get(key) {
 259     return result;
 260 }
 261 let value = /* compute value */;
 262 map.insert(key, value);
 263 ```
 264
 265 If you write that code manually, the dependency graph you get will
 266 include artificial edges that are not necessary. For example, imagine that
 267 two tasks, A and B, both invoke the manual memoization code, but A happens
 268 to go first. The resulting graph will be:
 269
 270     ... -> A -> MapVariant(key) -> B
 271     ~~~~~~~~~~~~~~~~~~~~~~~~~~~       // caused by A writing to MapVariant(key)
 272                 ~~~~~~~~~~~~~~~~~~~~  // caused by B reading from MapVariant(key)
 273
 274 This graph is not *wrong*, but it encodes a path from A to B that
 275 should not exist.  In contrast, using the memoized helper, you get:
 276
 277     ... -> MapVariant(key) -> A
 278                  |
 279                  +----------> B
 280
 281 which is much cleaner.
 282
 283 **Be aware though that the closure is executed with `MapVariant(key)`
 284 pushed onto the stack as the current task!** That means that you must
 285 add explicit `read` calls for any shared state that it accesses
 286 implicitly from its environment. See the section on "explicit calls to
 287 read and write when starting a new subtask" above for more details.
 288
 289 ### How to decide where to introduce a new task
 290
 291 Certainly, you need at least one task on the stack: any attempt to
 292 `read` or `write` shared state will panic if there is no current
 293 task. But where does it make sense to introduce subtasks? The basic
 294 rule is that a subtask makes sense for any discrete unit of work you
 295 may want to skip in the future. Adding a subtask separates out the
 296 reads/writes from *that particular subtask* versus the larger
 297 context. An example: you might have a 'meta' task for all of borrow
 298 checking, and then subtasks for borrow checking individual fns.  (Seen
 299 in this light, memoized computations are just a special case where we
 300 may want to avoid redoing the work even within the context of one
 301 compilation.)
 302
 303 The other case where you might want a subtask is to help with refining
 304 the reads/writes for some later bit of work that needs to be memoized.
 305 For example, we create a subtask for type-checking the body of each
 306 fn.  However, in the initial version of incr. comp. at least, we do
 307 not expect to actually *SKIP* type-checking -- we only expect to skip
 308 trans. However, it's still useful to create subtasks for type-checking
 309 individual items, because, otherwise, if a fn sig changes, we won't
 310 know which callers are affected -- in fact, because the graph would be
 311 so coarse, we'd just have to retrans everything, since we can't
 312 distinguish which fns used which fn sigs.
 313
 314 ### Testing the dependency graph
 315
 316 There are various ways to write tests against the dependency graph.
 317 The simplest mechanism are the
 318 `#[rustc_if_this_changed]` and `#[rustc_then_this_would_need]`
 319 annotations. These are used in compile-fail tests to test whether the
 320 expected set of paths exist in the dependency graph. As an example,
 321 see `src/test/compile-fail/dep-graph-caller-callee.rs`.
 322
 323 The idea is that you can annotate a test like:
 324
 325 ```rust
 326 #[rustc_if_this_changed]
 327 fn foo() { }
 328
 329 #[rustc_then_this_would_need(TypeckTables)] //~ ERROR OK
 330 fn bar() { foo(); }
 331
 332 #[rustc_then_this_would_need(TypeckTables)] //~ ERROR no path
 333 fn baz() { }
 334 ```
 335
 336 This will check whether there is a path in the dependency graph from
 337 `Hir(foo)` to `TypeckTables(bar)`. An error is reported for each
 338 `#[rustc_then_this_would_need]` annotation that indicates whether a
 339 path exists. `//~ ERROR` annotations can then be used to test if a
 340 path is found (as demonstrated above).
 341
 342 ### Debugging the dependency graph
 343
 344 #### Dumping the graph
 345
 346 The compiler is also capable of dumping the dependency graph for your
 347 debugging pleasure. To do so, pass the `-Z dump-dep-graph` flag. The
 348 graph will be dumped to `dep_graph.{txt,dot}` in the current
 349 directory.  You can override the filename with the `RUST_DEP_GRAPH`
 350 environment variable.
 351
 352 Frequently, though, the full dep graph is quite overwhelming and not
 353 particularly helpful. Therefore, the compiler also allows you to filter
 354 the graph. You can filter in three ways:
 355
 356 1. All edges originating in a particular set of nodes (usually a single node).
 357 2. All edges reaching a particular set of nodes.
 358 3. All edges that lie between given start and end nodes.
 359
 360 To filter, use the `RUST_DEP_GRAPH_FILTER` environment variable, which should
 361 look like one of the following:
 362
 363 ```
 364 source_filter     // nodes originating from source_filter
 365 -> target_filter  // nodes that can reach target_filter
 366 source_filter -> target_filter // nodes in between source_filter and target_filter
 367 ```
 368
 369 `source_filter` and `target_filter` are a `&`-separated list of strings.
 370 A node is considered to match a filter if all of those strings appear in its
 371 label. So, for example:
 372
 373 ```
 374 RUST_DEP_GRAPH_FILTER='-> TypeckTables'
 375 ```
 376
 377 would select the predecessors of all `TypeckTables` nodes. Usually though you
 378 want the `TypeckTables` node for some particular fn, so you might write:
 379
 380 ```
 381 RUST_DEP_GRAPH_FILTER='-> TypeckTables & bar'
 382 ```
 383
 384 This will select only the `TypeckTables` nodes for fns with `bar` in their name.
 385
 386 Perhaps you are finding that when you change `foo` you need to re-type-check `bar`,
 387 but you don't think you should have to. In that case, you might do:
 388
 389 ```
 390 RUST_DEP_GRAPH_FILTER='Hir&foo -> TypeckTables & bar'
 391 ```
 392
 393 This will dump out all the nodes that lead from `Hir(foo)` to
 394 `TypeckTables(bar)`, from which you can (hopefully) see the source
 395 of the erroneous edge.
 396
 397 #### Tracking down incorrect edges
 398
 399 Sometimes, after you dump the dependency graph, you will find some
 400 path that should not exist, but you will not be quite sure how it came
 401 to be. **When the compiler is built with debug assertions,** it can
 402 help you track that down. Simply set the `RUST_FORBID_DEP_GRAPH_EDGE`
 403 environment variable to a filter. Every edge created in the dep-graph
 404 will be tested against that filter -- if it matches, a `bug!` is
 405 reported, so you can easily see the backtrace (`RUST_BACKTRACE=1`).
 406
 407 The syntax for these filters is the same as described in the previous
 408 section. However, note that this filter is applied to every **edge**
 409 and doesn't handle longer paths in the graph, unlike the previous
 410 section.
 411
 412 Example:
 413
 414 You find that there is a path from the `Hir` of `foo` to the type
 415 check of `bar` and you don't think there should be. You dump the
 416 dep-graph as described in the previous section and open `dep-graph.txt`
 417 to see something like:
 418
 419     Hir(foo) -> Collect(bar)
 420     Collect(bar) -> TypeckTables(bar)
 421
 422 That first edge looks suspicious to you. So you set
 423 `RUST_FORBID_DEP_GRAPH_EDGE` to `Hir&foo -> Collect&bar`, re-run, and
 424 then observe the backtrace. Voila, bug fixed!