Optimized compiler and linker processing of changes to literals and source code formatting #4907

tarsa · 2023-09-24T18:52:10Z

Is it feasible to add special-case compiler (and linker) support for very quick processing of changes mentioned in initial posts of Incremental linker #1626, i.e. changes to literals and source code formatting?

#1626 (comment)

ramnivas April 1 2015
The change I made was a trivial one ("foo" -> "fooo")
japgolly April 1 2015
Me too This is my output from after just adding a newline to a file:

#1626 (comment)

easel
That's a one-character change to a string constant

Such mechanism would allow for some impressive demos (probably important to get a good opinion from newcomers and commentators), but otherwise wouldn't be universally useful.

Originally posted by @tarsa in #1626 (comment)

tarsa · 2023-09-24T19:00:43Z

I've not done any substantial metaprogramming, but (most probably, to keep things reasonable) if the changed literals or source code positions are used in macros (or any sort of metaprogramming), especially in conditions, then the compiler would need to fall back to regular compilation process.

sjrd · 2023-09-25T15:39:02Z

Also copying @gzm0's answer from #1626 (comment) :

Is it feasible to add special-case compiler (and linker) support for very quick processing of changes mentioned in initial posts, i.e. changes to literals and source code formatting?

This should be possible, yes. Essentially, what we could do (in Scala.js linker internal parlens), re-use the previous Analysis if none of the Infos changed.

However, I'm not sure we can do this without incurring additional cost on linking, so we need to weight this trade-off carefully.

If you feel we should investigate this, may I suggest you open another issue to discuss this (IMHO the scope is quite different)?

sjrd · 2023-09-25T15:42:19Z

TBH I don't think it makes sense to implement this. The crux of the matter is:

However, I'm not sure we can do this without incurring additional cost on linking, so we need to weight this trade-off carefully.

Do we really want to spend additional time on every linker run so that we can save time on the (likely vanishingly) small number of runs where Infos don't change at all? What good are "impressive demos" if we make the actual use cases worse?

tarsa · 2023-09-30T12:26:21Z

Well, it depends on overhead (in non-optimizable case) then. If it's non-negligible, but reasonably small, then perhaps a survey would help deciding whether to keep it?

Personally, I don't have strong opinion on this optimization. I don't feel I need it for myself, but I've thought it could make Scala.js more attractive to programmers, especially newcomers. If you think otherwise, then this ticket could be closed.

gzm0 · 2023-10-07T08:04:07Z

I've been thinking about this: I'll start experimenting with eagerly re-loading all previously loaded infos in a subsequent analyzer run. This is probably a good heuristic. If this doesn't hurt performance (for all we know, it might even increase it, since it flattens the parallelism graph), we might have a case to build this.

gzm0 · 2023-10-07T08:26:14Z

Seems like preloading hurts performance (even if nothing changed). I suspect lock contention.

IIUC the only additional work we introduce with the change blow (in a "nothing changed" incremental compile) is the outermost loop and some additional GC pressure.

logger-timings.csv

--- a/linker/shared/src/main/scala/org/scalajs/linker/analyzer/Analyzer.scala
+++ b/linker/shared/src/main/scala/org/scalajs/linker/analyzer/Analyzer.scala
@@ -53,11 +53,18 @@ final class Analyzer(config: CommonPhaseConfig, initial: Boolean,
     )
   }
 
+  private var previousAnalysis: Analysis = _
+
   def computeReachability(moduleInitializers: Seq[ModuleInitializer],
       symbolRequirements: SymbolRequirement, logger: Logger)(implicit ec: ExecutionContext): Future[Analysis] = {
 
     infoLoader.update(logger)
 
+    if (previousAnalysis != null) {
+      for (className <- previousAnalysis.classInfos.keys)
+        infoLoader.loadInfo(className)
+    }
+
     val run = new AnalyzerRun(config, initial, infoLoader)(
         adjustExecutionContextForParallelism(ec, config.parallel))
 
@@ -67,6 +74,10 @@ final class Analyzer(config: CommonPhaseConfig, initial: Boolean,
         if (failOnError && run.errors.nonEmpty)
           reportErrors(run.errors, logger)
 
+        // TODO: Make it depend on the real future.
+        // Don't store if it has errors.
+        previousAnalysis = run
+
         run
       }
       .andThen { case _ => infoLoader.cleanAfterRun() }

gzm0 · 2023-10-07T08:31:49Z

> d %>%
  filter(grepl('Compute reachability', op)) %>%
  group_by(variant, op) %>%
  summarise(t_ns = median(t_ns))
`summarise()` has grouped output by 'variant'. You can override using the `.groups` argument.
# A tibble: 4 × 3
# Groups:   variant [2]
  variant op                                  t_ns
  <fct>   <fct>                              <dbl>
1 main    Linker: Compute reachability  148486462.
2 main    Refiner: Compute reachability 139398791 
3 preload Linker: Compute reachability  181934764 
4 preload Refiner: Compute reachability 150051568

Seems we're losing about 30ms on the first pass and 10ms on the refiner.

gzm0 · 2023-10-07T08:51:46Z

I might have spoken too soon: Parallelizing the pre-loading right away gives comparable performance:

    if (previousAnalysis != null) {
      Future.traverse(previousAnalysis.classInfos) { case (className, prevInfo) =>
        infoLoader.loadInfo(className) match {
          case None      => Future.successful(false)
          case Some(fut) => fut.map(_ == prevInfo.data)
        }
      }
    }

logger-timings.csv

tarsa mentioned this issue Sep 24, 2023

Incremental linker #1626

Closed

gzm0 added the optimization Optimization only. Does not affect semantics or correctness. label Jan 2, 2024

gzm0 linked a pull request Jan 7, 2024 that will close this issue

WIP: Re-use previous analysis if dependency graph is unchanged #4927

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized compiler and linker processing of changes to literals and source code formatting #4907

Optimized compiler and linker processing of changes to literals and source code formatting #4907

tarsa commented Sep 24, 2023

tarsa commented Sep 24, 2023 •

edited

sjrd commented Sep 25, 2023

sjrd commented Sep 25, 2023

tarsa commented Sep 30, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

Optimized compiler and linker processing of changes to literals and source code formatting #4907

Optimized compiler and linker processing of changes to literals and source code formatting #4907

Comments

tarsa commented Sep 24, 2023

tarsa commented Sep 24, 2023 • edited

sjrd commented Sep 25, 2023

sjrd commented Sep 25, 2023

tarsa commented Sep 30, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

gzm0 commented Oct 7, 2023

tarsa commented Sep 24, 2023 •

edited