Proposal: Provide feedback on push progress. #1262

coollog · 2018-11-20T17:14:15Z

Regards issues: #1251, #806

Project tracker at #1297

Preview at: https://github.com/GoogleContainerTools/jib/blob/proposal-progress-output/proposals/progress_output.md

Brief recording of the current state of the POC: https://asciinema.org/a/H4BIcjEYsSa8Rtyy0ayZt01Xh

chanseokoh · 2018-11-20T19:00:35Z

proposals/progress_output.md

+
+### Alternative considerations
+
+- display byte-completion (eg. `10MB/100MB`)


I like this. Some layers are tiny and others huge. You get the feeling of how much it will take when you know the entire size.

What do you think about including time elapsed and/or a progress bar as well?

Adding time sounds fine, but I think the process bar won't help much unless you are doing in-place ncurses-like line replacement. I think we can start simple.

Don't we need to do in-place replacement anyway if we're showing byte completion?

It'll be just like the percentage completion except rather than a percent, it will show the absolute byte counts.

Elapsed time and throughput (average and instantaneous) are helpful when diagnosing sloooow connections.

Let's take a poll:

Place your name after the option(s) you would like to vote for

byte completion | votes: chanseokoh, TadCordle

time elapsed | votes: chanseokoh, TadCordle

progress bar | votes:

throughput | votes: chanseokoh (average only)

loosebazooka · 2018-11-20T19:27:21Z

In Jib core this will be implemented as events?
We don't have to use gradle/maven logging directly. We might be able to use system.out and mess with \r conditionally based on the log level

if loglevel.atleast INFO {
  print "progress\r"
}

We could then potentially do a single progress bar of some style for example:

Executing 6/10 tasks [========================         ] 80% complete

coollog · 2018-11-20T20:17:06Z

Yes, we should have the progress be emitted via progress events.
Not sure how we would be able to find which line to update in this case since different lines may print after other lines return to the beginning?
So as in we would perform synchronization among the tasks? I feel like we should avoid that since it might unnecessarily add a bit to build times and increase complexity.

loosebazooka · 2018-11-20T20:21:10Z

Yeah I'm not sure how clean this would be, but ending a line with \r should just move the cursor so any new log message will overwrite it and our next progress will just be on the latest line.
Would not be synchronization of the tasks, but smarter handling of the progress on the plugin side.

chanseokoh · 2018-11-20T20:23:30Z

So as in we would perform synchronization among the tasks? I feel like we should avoid that since it might unnecessarily add a bit to build times and increase complexity.

One way is to do something our internal build tool does: start with known total bytes to process, and as we discover more bytes to process, re-scale the whole progress workload. For example, at first it might look "10MB/100MB", but as you start processing more layers, it may become "11MB/254MB", "12MB/318MB", and so on.

loosebazooka · 2018-11-20T20:25:10Z

Oh yeah, sorry, the progress bar would be variable as @chanseokoh describes, moving around like a drunken windows progress bar (in a good way)

coollog · 2018-11-20T20:35:04Z

Yea, a variable progress bar would work, though it would still mean that there would have to be synchronization of some running total across the tasks though, whereas currently we don't have any explicit shared mutable memory between the concurrent tasks.

loosebazooka · 2018-11-20T20:36:00Z

They all just send their progress over to the progress handler? And it does the math? They don't have to know about eachother.

coollog · 2018-11-20T20:39:10Z

The tasks won't need to know about each other, but the progress handler would need to have synchronized memory to handle the concurrent mutations of its progress/total counters, which would lock the tasks against each other during progress updates.

loosebazooka · 2018-11-20T20:51:42Z

Oh I see, so the event manager isn't dispatching events on a single thread? A UI Thread, like swing?

coollog · 2018-11-20T20:57:01Z

The event manager dispatches to a method call on the thread it is dispatched from, although that method could be implemented to execute logic on another thread if needed. However, we probably should avoid having another thread purely for progress monitoring since that could incur a significant cost in context switching for all the progress updates.

chanseokoh · 2018-11-20T21:11:29Z

For updating a global counter, I think AtomicInteger.getAndAdd() may work, which atomically adds a number in a lock-free and thread-safe manner. No explicit synchronization is required.

From the Javadoc,

The specifications of these methods enable implementations to employ efficient machine-level atomic instructions that are available on contemporary processors. However on some platforms, support may entail some form of internal locking. Thus the methods are not strictly guaranteed to be non-blocking -- a thread may block transiently before performing the operation.

I think nowadays with most users using contemporary processors, it will work efficiently without blocking. Even if it blocks, it should be transient (which I don't think will actually happen in practice on modern machines). And given the very low update frequency (10-second interval or so?), the chance of contention seems extremely low.

chanseokoh · 2018-11-20T21:14:28Z

Oh I see, so the event manager isn't dispatching events on a single thread? A UI Thread, like swing?

AFAIK, Gradle uses a separate dedicated thread (via Executors.newSingleThreadExecutor()) to print logs, while on the Maven side, logs are printed by the thread firing the log event.

coollog · 2018-11-20T22:08:13Z

There's also the issue that we would not be able to log anything else after the progress bar starts and before it finishes, so this would not be compatible with other messages like retrying with auth token. The original "Building..." messages would all be replaced too.

chanseokoh · 2018-11-20T22:31:37Z

Yeah, so I'm actually skeptical with the \r approach. (On the other hand, the single, uber progress "bar" can still work without \r.)

loosebazooka · 2018-11-20T22:38:24Z

Yeah using \r is susceptible to corruption, but the idea is basically something like this

#!/bin/bash

printf "progress [----    ]\r"
sleep 1
printf "some log statement here\n"
printf "progress [----   ]\r"
sleep 1
printf "some other log statment\n"
printf "progress [----   ]\r"

TadCordle · 2018-11-26T16:34:03Z

I think I'm with @loosebazooka on this one, \r may be susceptible to corruption but I think we can make it work if we're careful. It may be a little extra work on our end but I think it ends up being a better user experience than just printing more log messages.

coollog · 2018-11-29T21:08:36Z

Updated with design towards the progress bar approach. See Proposal section. If looks good, I will start implementing the DAT-based progress events first.

chanseokoh · 2018-11-29T21:54:40Z

proposals/progress_output.md

+
+These issues can be resolved with a *decentralized allocation tree*.
+
+#### Decentralized allocation tree (DAT)


I read through it, and still not clear if this will work as intended, if what I understood is correct, which makes me think probably I'm missing something.

So, what I can only think of for the intention of a tree is that, basically, the root node corresponds to the top-level entity that kicks off multiple second-level async "steps". For example, if you look at BuildSteps.forBuildToDockerRegistry, it spawns 12 steps:

public static BuildSteps forBuildToDockerRegistry(BuildConfiguration buildConfiguration) { return new BuildSteps( DESCRIPTION_FOR_DOCKER_REGISTRY, buildConfiguration, () -> new StepsRunner(buildConfiguration) .runRetrieveTargetRegistryCredentialsStep() .runAuthenticatePushStep() .runPullBaseImageStep() .runPullAndCacheBaseImageLayersStep() .runPushBaseImageLayersStep() .runBuildAndCacheApplicationLayerSteps() .runBuildImageStep() .runPushContainerConfigurationStep() .runPushApplicationLayersStep() .runFinalizingPushStep() .runPushImageStep() .waitOnPushImageStep()); }

So you intend that you create a root node here with 12 allocation units, pass the root node to the 12 Step instances, and each step will create a single child node (with some allocation units) soon, resulting in 13 nodes (1 root + 12 children) at some point? And each Step further passe down its child node to methods and instances it own in the similar manner?

Yes, that is how it is intended to work. The only thing that will need to known prior to an allocation node being created is the number of allocation units that node should have.

Oh one thing to note is that the root node with 12 allocation units does not necessarily need to have 12 children - allocation units can be "occupied" by non-node-based updates (like received bytes in the download progress case).

If so (passing down the root node to multiple async step instances), and if the root and the children are physically connected (I assume so, because otherwise, it isn't a tree at all), isn't it that you are sharing the tree (at least the root node) concurrently? It sounds like basically you are sharing a single progress state. I thought it'd be cool if anyone could just fire progress events in an isolated manner, not having to establish any connection in relation to others. With no communications with others, it may be impossible to avoid the "backward-moving" progress bar, but I think it's worth the sacrifice for simplicity.

The node(s) are passed down, but they are all immutable with no progress state. The state is only kept at the receiving end (the progress monitor that receives the progress events). The DAT is essentially just a representation of what fraction of the total progress a progress event actually accounts for. This allows for progress events to be emitted asynchronously and in a decentralized manner, only to be collected by the progress monitor and summed into an overall current progress.

briandealwis · 2018-11-30T02:57:59Z

https://github.com/GoogleCloudPlatform/appengine-plugins-core/blob/master/src/main/java/com/google/cloud/tools/managedcloudsdk/ChildProgressListener.java

TadCordle · 2018-12-03T15:49:26Z

proposals/progress_output.md

+### Example
+
+```
+Executing tasks: Pushing classes layer, pulling base image layer 50501d3b88f7, pushing dependencies layer, pushing base image layer 8b106a18283f


It may end up being difficult reading these log messages if we put them on one line/constantly update them; it seems like this layout would be great for serial operations, but when we have a bunch of steps executing at once, showing the log messages like this may not add a lot of value.

Update: working on prototyping having these on multiple lines - it makes things look a lot clearer

TadCordle · 2018-12-03T15:50:12Z

proposals/progress_output.md

+
+## Proposal
+
+Display an overall progress bar along with the tasks currently being executed.


How will this work at different logging levels?

I think we can disable the progress bar for logging levels debug and higher since there will be many log messages to indicate progress anyways, but the progress bar itself should be implemented such that it does not interfere with the other log messages Jib emits.

coollog · 2018-12-03T22:35:10Z

A recording of the WIP POC: https://asciinema.org/a/H4BIcjEYsSa8Rtyy0ayZt01Xh

There are still a few artifacts to clean up, most notably Gradle's own progress logs being left over. Also, the old log messages would be disabled since they don't add anything of value anymore.

TadCordle

The prototype looks good to me, I think you just need to update the proposal with your findings from prototyping.

coollog · 2018-12-04T22:05:21Z

Updated the example and made a note about the things that need to be handled for Gradle logging.

proposals/progress_output.md

Proposal: Provide feedback on push progress.

1eb1942

coollog added the proposal label Nov 20, 2018

coollog added this to the v0.10.1 milestone Nov 20, 2018

coollog requested a review from a team November 20, 2018 17:14

googlebot added the cla: yes label Nov 20, 2018

coollog added 2 commits November 20, 2018 12:14

Fixes issue links.

cfd4fbd

Improves Motivation.

72a933e

chanseokoh reviewed Nov 20, 2018

View reviewed changes

Updates to incorporate design for progress bar instead.

6008ece

coollog requested a review from a team November 29, 2018 21:09

Adds note.

d0e5b21

chanseokoh reviewed Nov 29, 2018

View reviewed changes

TadCordle reviewed Dec 3, 2018

View reviewed changes

TadCordle requested a review from a team December 3, 2018 15:59

coollog mentioned this pull request Dec 4, 2018

Provide feedback on containerization progress. #1297

Closed

28 tasks

TadCordle approved these changes Dec 4, 2018

View reviewed changes

Updates according to new findings.

881ebec

chanseokoh reviewed Dec 5, 2018

View reviewed changes

proposals/progress_output.md Outdated Show resolved Hide resolved

Fix sub percentage.

5eaad1f

coollog modified the milestones: v0.10.1, v1.0.0 Dec 5, 2018

coollog requested a review from a team December 5, 2018 17:38

coollog merged commit 043ec64 into master Dec 5, 2018

coollog deleted the proposal-progress-output branch December 5, 2018 19:37


		### Alternative considerations

		- display byte-completion (eg. `10MB/100MB`)


		These issues can be resolved with a decentralized allocation tree.

		#### Decentralized allocation tree (DAT)


		## Proposal

		Display an overall progress bar along with the tasks currently being executed.

Proposal: Provide feedback on push progress. #1262

Proposal: Provide feedback on push progress. #1262

Conversation

coollog commented Nov 20, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coollog Nov 20, 2018 • edited by TadCordle Loading

Choose a reason for hiding this comment

loosebazooka commented Nov 20, 2018

coollog commented Nov 20, 2018

loosebazooka commented Nov 20, 2018

chanseokoh commented Nov 20, 2018

loosebazooka commented Nov 20, 2018 • edited Loading

coollog commented Nov 20, 2018

loosebazooka commented Nov 20, 2018

coollog commented Nov 20, 2018

loosebazooka commented Nov 20, 2018

coollog commented Nov 20, 2018

chanseokoh commented Nov 20, 2018 • edited Loading

chanseokoh commented Nov 20, 2018

coollog commented Nov 20, 2018

chanseokoh commented Nov 20, 2018

loosebazooka commented Nov 20, 2018

TadCordle commented Nov 26, 2018

coollog commented Nov 29, 2018

chanseokoh Nov 29, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coollog Nov 29, 2018 • edited Loading

Choose a reason for hiding this comment

chanseokoh Nov 29, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

briandealwis commented Nov 30, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coollog commented Dec 3, 2018 • edited Loading

TadCordle left a comment

Choose a reason for hiding this comment

coollog commented Dec 4, 2018

coollog commented Nov 20, 2018 •

edited

Loading

coollog Nov 20, 2018 •

edited by TadCordle

Loading

loosebazooka commented Nov 20, 2018 •

edited

Loading

chanseokoh commented Nov 20, 2018 •

edited

Loading

chanseokoh Nov 29, 2018 •

edited

Loading

coollog Nov 29, 2018 •

edited

Loading

chanseokoh Nov 29, 2018 •

edited

Loading

coollog commented Dec 3, 2018 •

edited

Loading