Moved uspace field from thread_t to proc_t #289

rafalcieslak · 2017-05-02T14:37:14Z

Moving td_uspace to p_uspace turned out to be more complicated than I expected it to be. This is because we've been using the td_uspace field in two ways:

To store the userspace map of a particular user-mode thread
To keep a temporary userspace map that is different from 1. when we need to do some operations on it and need to be sure it will be re-activated when the current thread continues after preemption.

If we want the uspace to be a property of a process and shared between multiple threads, than we can no longer use it for 2, it wouldn't make any sense. Therefore I had to significantly modify the semantics of vm_map_activate. Until now, it would activate a vm_map and assign it to the current thread. As threads no longer have each their own vm_map reference, the former is no longer possible. So now all vm_map_activate does is activating the vm_map for use. This, in turn, means that we need to make sure to disable preemption while we're using an active vm_map that is not native to the current thread/process. Therefore I've been forced to add extra critical_sections to vm_map_clone and do_exec, as both these temporarily activate a different userspace vm_map in order to prepare it.

I've also had to completely disable the sched test - as threads no longer carry a uspace, the test is pointless. It was already marked as KTEST_FLAG_BROKEN, so it's not like we've been using it a lot.

cahirwpz · 2017-05-02T16:59:45Z

sys/vm_map.c

 #include <mips/mips.h>

 static vm_map_t kspace;
+static vm_map_t *uspace = NULL;


There's better place to keep it – please move it to pcpu structure.

rafalcieslak · 2017-05-02T17:24:37Z

mips/switch.S

@@ -39,7 +39,7 @@ ctx_resume:
        LOAD_PCPU($t0)
        sw      $s0, PCPU_CURTHREAD($t0)

-        lw      $a0, TD_USPACE($s0)
+        lw      $a0, PCPU_USPACE($t0)


I'm not sure that works at all? Instead of reading which vm_map this ~~thread~~ process uses, you just activate the one that is currently active. That doesn't change the current map at all, right? Or do I miss something?

Oops... you're right. It's a shame I introduced a bug with this change -_-

cahirwpz · 2017-05-03T09:49:45Z

As you know critical_* mechanism is not a generic way to synchronize threads. It should be limited to the implementation of basic synchronization primitives and some parts of scheduler. Every time we use critical_* outside of this scope shows a design flaw in our kernel, and wrapping significant parts of do_exec and vm_map_clone makes the flaw apparent to full extent.

We have to find an alternative way to synchronize the parts of kernel that needlessly use critical_* primitive. Firstly we should identify minimal guarantees that are needed to perform the task and then choose right solution. Perhaps it's time to revisit #242. I'll reread spl(9) and kpreempt(9). Your suggestions are welcome!

rafalcieslak · 2017-05-03T13:58:41Z

Yes, I've only used critical_* due to the lack of any better alternative. But I suppose we don't necessarily need better synchronization features for this particular PR.

First thing is that I wouldn't worry much about vm_map_clone - the entire function is a temporary hack which we hope to remove soon once we have better memory management features available.

do_exec is different - we care about its implementation. The nature of the exec system call is such that on successful exec, all other threads of the calling process are terminated (and it's probably rare for an exec to get called by a multi-threaded context). We could hypothetically terminate all other threads before preparing the new vm_map. That would be convenient, once other threads are terminated, we can safely assign the vmap to p_uspace, and only then prepare the vmap - this way we'll be safe if we get preempted, as no other threads would share the vmap yet. However, if exec happens to fail, all other threads must stay alive (or do they? I can't find specific info on that anywhere, but it sounds logical not to terminate them if exec would not succeed.) - so before we terminate them we need to be sure that no further errors may happen. So for this to work, we would need to validate the entire ELF file first, then kill other threads, then assign a new vmap, then setup the vmap according to the ELF file. If that sounds okay to you, I can try re-implement exec to use this two-stage setup, and we'll never need a critical section. OTOH, if we want the other threads alive while we prepare new process memory map, then the minimal guarantee we need for this to work is that no other threads of the same process can continue while we're setting up vmap.

Side note: I haven't thought about it in greater detail, but what if a thread that activates a particular uspace would lock a recursive mutex on that vm_map structure? The lock would be released/acquired in the vm_map_activate function. When editing a vm_map, lock the mutex recursively once more. This way no other threads will be able to activate a vm_map while it's being edited by another thread, and no thread will be able to edit a vm_map while it's in use by another thread (e.g. on another processor). It sounds almost to simple to be true...?

cahirwpz · 2017-05-04T09:12:24Z

Multithreading doesn't play well with unix process model. Handling properly fork in multithreaded application is problematic. POSIX.1 specification for exec* doesn't say exactly what to do when exec call fails:

A call to any exec function from a process with more than one thread shall result in all threads being terminated and the new executable image being loaded and executed.

A programmer that calls fork or exec in multithreaded application does not understand consequences of running multiple threads very well. IMO in most cases introduction of threads is driven by bad design decisions.

Let's ignore weird corner cases and support the most common scenario well , which is fork + exec. When exec fails a programmer is supposed to terminate the process.

rafalcieslak · 2017-05-04T11:10:54Z

I've managed to confirm that Linux destroys all threads on a call to execve, regardless whether the call was successful or not - and that is also a good indication that it's okay if we do the same. I'll update this branch soon.

cahirwpz · 2017-05-11T09:57:59Z

sys/vm_map.c

-     it's okay if we get preempted - the working vm map will be restored on
-     context switch. */
+  /* Temporarily switch to the new map, so that we may write contents. */
+  critical_enter();


Is really critical_enter() required here? Can we make weaker assumptions?

Perhaps thread_self()->proc->p_nthreads == 1 assertion is just enough to perform this step correctly?

We'd need to check if there is only one thread in this process AND set proc->p_uspace = newmap for the duration of the copy process.

EDIT: Also, this would only work if clone is called by a user thread. Kernel threads have their userspace map reset to NULL on each context switch.

Ok. So the minimum assumption we must take is that we enter vm_map_clone with:

user-space thread (i.e. withp_uspace in place),

the only thread in a process – thus we know that no other thread (within the same process) will be started in the meantime.

If that is correct, please drop critical section and enforce above assumptions with assert.

rafalcieslak added 2 commits May 2, 2017 15:49

Moved uspace field from thread_t to proc_t

52c9725

Merge branch 'master' into proc_struct2

2f3823e

cahirwpz reviewed May 2, 2017

View reviewed changes

rafalcieslak and others added 3 commits May 2, 2017 19:09

Moved uspace vm_map ptr to pcpu struct

0dbe37c

Simplify code a bit.

185d07a

Remove unused TD_PROC

3f1bea7

rafalcieslak commented May 2, 2017

View reviewed changes

cahirwpz and others added 3 commits May 2, 2017 20:30

Fix a bug I introduced with 3f1bea7

50e8d4c

Merge branch 'master' into proc_struct2

93c67a7

Fixed formatting

4a520e6

rafalcieslak added 5 commits May 4, 2017 13:33

Terminating other threads on 'exec'

bbe8665

Set process parent on fork

063dd73

Removed redundant critical_leave

c995193

Introduced the assumption of simgle thread per process

666e97d

Fixed formatting

11da4e0

cahirwpz reviewed May 11, 2017

View reviewed changes

rafalcieslak added 2 commits May 11, 2017 12:18

Merge branch 'master' into proc_struct2

e89a719

Dropped a critical section in vm_map_clone

6ca0181

cahirwpz merged commit a8f0b4b into cahirwpz:master May 11, 2017

rafalcieslak mentioned this pull request May 16, 2017

Process structure #278

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Moved uspace field from thread_t to proc_t #289

Moved uspace field from thread_t to proc_t #289

rafalcieslak commented May 2, 2017

cahirwpz May 2, 2017 •

edited

Loading

rafalcieslak May 2, 2017 •

edited

Loading

cahirwpz May 2, 2017

cahirwpz commented May 3, 2017

rafalcieslak commented May 3, 2017

cahirwpz commented May 4, 2017

rafalcieslak commented May 4, 2017

cahirwpz May 11, 2017

rafalcieslak May 11, 2017 •

edited

Loading

cahirwpz May 11, 2017

Moved uspace field from thread_t to proc_t #289

Moved uspace field from thread_t to proc_t #289

Conversation

rafalcieslak commented May 2, 2017

cahirwpz May 2, 2017 • edited Loading

Choose a reason for hiding this comment

rafalcieslak May 2, 2017 • edited Loading

Choose a reason for hiding this comment

cahirwpz May 2, 2017

Choose a reason for hiding this comment

cahirwpz commented May 3, 2017

rafalcieslak commented May 3, 2017

cahirwpz commented May 4, 2017

rafalcieslak commented May 4, 2017

cahirwpz May 11, 2017

Choose a reason for hiding this comment

rafalcieslak May 11, 2017 • edited Loading

Choose a reason for hiding this comment

cahirwpz May 11, 2017

Choose a reason for hiding this comment

cahirwpz May 2, 2017 •

edited

Loading

rafalcieslak May 2, 2017 •

edited

Loading

rafalcieslak May 11, 2017 •

edited

Loading