Spawn callback #2727

rkdarst · 2019-09-09T16:01:18Z

At the JupyterHub/BinderHub workshop, @minrk and I discussed adding the spawn callback from batchspawner to JupyterHub itself. Quote from batchspawner issue:

I talked to @minrk and he thought that it would be reasonable to put the the callback handler into the main hub and expect every spawner to hit it once (even if they don't send any useful information back). He thought that there would be other benefits, too: right now, after the spawner returns the URL, the hub polls it until the single-user HTTP server is reachable. This would avoid that polling and allow faster response when starting. Then, spawners can return the URL (or port... but whole URL will be the most general) if it wasn't known at the time of Spawner.start() returning. The payload from the callback can be sent to a spawner hook to process it and do what is needed.

This is a non-working pull request that just has the template of the code that Min showed me, in case anyone else wants to take a look before I get to it (feel free!). It hasn't even been tested and I haven't tried to make the pieces actually work yet.

The issues we know of so far (may be edited):

Some spawners don't know their own address (but this is still useful - reduces polling for startup).
The spawner has to be able to tell the single-user server what information should be sent back. Spawners are not necessarily installed in single-user environments and we probably don't want to require them to be all the time.
Should the spawner be able to indicate that the hub should/shouldn't wait for the
What should the default handler do?

Copied here from jupyterhub/batchspawner#146

minrk

Excellent, thanks for opening this!

I think the only trick right now is to figure out how and if Spawners customize what goes in the payload. Notebook server extensions are one option, but that becomes tricky very fast. If we can support a baseline of information (bind url, hostname, pid), hopefully most cases can be covered. I see in BatchSpawner that only port is passed along. I would think that hostname would also be useful, perhaps replacing state_gethost?

I'll poke around here to see if we can come up with something.

minrk · 2019-09-24T10:10:07Z

jupyterhub/spawner.py

+            self.hub.api_url,
+            'users',
+            # tolerate mocks defining only user.name
+            getattr(self.user, 'escaped_name', self.user.name),


no need for getattr check here, since this is in the same codebase that defines escaped_name

The code right above (JUPYTERHUB_ACTIVITY_URL) has this too with the same message (mocks defining only user.name)... if that needs it then this would, too. What should be done?

minrk · 2019-09-24T10:12:03Z

jupyterhub/singleuser.py

+        async def notify():
+            self.log.debug("Notifying Hub of readyness via spawn_callback")
+            req = HTTPRequest(
+                url=self.hub_activity_url,


spawn_callback_url?

And only call it if spawn_callback_url is set, which effectively implements single-user -> hub backward compatiblity (not the other way around)

jupyterhub/singleuser.py

minrk · 2019-09-24T10:13:01Z

jupyterhub/singleuser.py

+                body=json.dumps({}),
+            )
+            try:
+                await client.post(req)


I think we use client.fetch here and method='POST' above in the Request

rkdarst · 2019-09-24T11:02:01Z

I think the only trick right now is to figure out how and if Spawners customize what goes in the payload. Notebook server extensions are one option, but that becomes tricky very fast. If we can support a baseline of information (bind url, hostname, pid), hopefully most cases can be covered. I see in BatchSpawner that only port is passed along. I would think that hostname would also be useful, perhaps replacing state_gethost?

That's roughly what I was thinking, then it's up to the spawner handler to decide which is actually useful, how it should actually be interpreted, and if any is . Then it's a balance between adding extra things to the jupyterhub singleuser-server or having spawners add extensions. But we can deal with this later. Things I can think of to collect: port, hostname, raw IP(s), pid. Maybe the contents of a JUPYTERHUB_SINGLEUSER_BIND_URL environment variable... that would allow spawner start scripts to send arbitrary data without making full-scale extensions.

Spawner authors, what would be useful?

rkdarst · 2019-09-24T14:37:49Z

I'm getting this problem in tests:

[W 53:32.933 MockHub web:1782] 400 POST /@/space%20word/hub/api/users/admin/servers/spawn_callback (127.0.0.1): Named servers are not enabled.
[W 53:32.934 MockHub log:174] 400 POST /@/space%20word/hub/api/users/admin/servers/spawn_callback (admin@127.0.0.1) 9.43ms

so it thinks that /spawn_callback is the server name. Handlers are defined this way:

    (r"/api/users/([^/]+)/servers/([^/]*)", UserServerAPIHandler),
    (r"/api/users/([^/]+)/servers/([^/]*)/progress", SpawnProgressAPIHandler),
    (r"/api/users/([^/]+)/servers/([^/]*)/spawn_callback", SpawnCallbackAPIHandler),

If progress works, then spawn_callback should as far as I could tell. Even reordering the handlers so that the spawn_callback one is first doesn't seem to help. I've banged on this long enough I thought would ask for other eyes. My first thought is something trivial like a small misspelling, but I can't find it. Also ,the progress URL seems to be called with only one slash successfully...

On a side note... if there's a named server called "progress" or "spawn_callback", does stuff start to break?

Hoeze · 2020-03-29T16:35:38Z

@rkdarst Is this functional?

rkdarst · 2020-03-29T21:20:31Z

No, not yet. If anyone wants to take a look, I can make sure all my work is pushed so far (but from what I can tell, it mostly is except for some quick debugging stuff).

Hoeze · 2020-04-05T01:16:14Z

Hey, I tried to get this running but the only thing I accomplished until now is to have tests passing with the current Jupyterhub master:
https://github.com/Hoeze/jupyterhub

How do I get a good testing setup? At the moment I just run:

export JUPYTERHUB_API_URL=http://127.0.0.1:8081/hub/api
export JUPYTERHUB_CLIENT_ID=jupyterhub-user-<username>-asdf
export JUPYTERHUB_OAUTH_CALLBACK_URL=/user/<username>/asdf/oauth_callback
export JUPYTERHUB_SERVER_NAME=asdf
export JUPYTERHUB_SERVICE_PREFIX=/user/<username>/asdf/
export JUPYTERHUB_USER=<username>
jupyterhub-singleuser --port 8234

What is missing to get this working?

manics · 2022-05-21T14:28:36Z

I've marked this as draft

rkdarst mentioned this pull request Sep 9, 2019

"Remote port selection" allows redesigning batchspawner to avoid polling/spawner interaction jupyterhub/batchspawner#146

Open

minrk reviewed Sep 24, 2019

View reviewed changes

rkdarst force-pushed the spawn-callback branch from 8184ac8 to 98b2c92 Compare September 24, 2019 12:46

rkdarst force-pushed the spawn-callback branch from 98b2c92 to e4d9d6d Compare December 28, 2019 23:58

This was referenced Mar 29, 2020

Change worker spawn logic: Implement singleuser command as Jupyterhub client #2986

Closed

Batchspawner spawning / keep-alive is instable jupyterhub/batchspawner#174

Closed

rkdarst added 2 commits April 18, 2020 16:30

Spawn callback, template code (non-functional)

8a39e95

Add debugging

e957aa6

rkdarst force-pushed the spawn-callback branch from e4d9d6d to e957aa6 Compare April 18, 2020 13:30

minrk mentioned this pull request Jun 2, 2020

utils.random_port function makes wrong assumptions #3005

Open

consideRatio added the enhancement label Oct 15, 2020

rkdarst mentioned this pull request Oct 16, 2020

[Feature] Let singleuser server select a free random port to listen on jupyterhub/kubespawner#448

Draft

mbmilligan mentioned this pull request Feb 17, 2021

Select single user server port on remote host #1830

Open

minrk mentioned this pull request May 4, 2021

Stop specifying --ip and --port on the command-line #3381

Merged

manics marked this pull request as draft May 21, 2022 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spawn callback #2727

Spawn callback #2727

rkdarst commented Sep 9, 2019

minrk left a comment

minrk Sep 24, 2019

rkdarst Sep 24, 2019

minrk Sep 24, 2019

minrk Sep 24, 2019

minrk Sep 24, 2019

rkdarst commented Sep 24, 2019

rkdarst commented Sep 24, 2019

Hoeze commented Mar 29, 2020

rkdarst commented Mar 29, 2020 via email

Hoeze commented Apr 5, 2020

manics commented May 21, 2022

Spawn callback #2727

Are you sure you want to change the base?

Spawn callback #2727

Conversation

rkdarst commented Sep 9, 2019

minrk left a comment

Choose a reason for hiding this comment

minrk Sep 24, 2019

Choose a reason for hiding this comment

rkdarst Sep 24, 2019

Choose a reason for hiding this comment

minrk Sep 24, 2019

Choose a reason for hiding this comment

minrk Sep 24, 2019

Choose a reason for hiding this comment

minrk Sep 24, 2019

Choose a reason for hiding this comment

rkdarst commented Sep 24, 2019

rkdarst commented Sep 24, 2019

Hoeze commented Mar 29, 2020

rkdarst commented Mar 29, 2020 via email

Hoeze commented Apr 5, 2020

manics commented May 21, 2022