Add (Int, +) finger trees #766

treeowl · 2021-02-03T21:37:59Z

Export a type for (Int,+) finger trees.
Export more Data.Sequence internals.
Offer a module of Data.Sequence internals intended
for external use, that should obey the PVP.
Remove Functor and Traversable instances for FingerTree and Node.
Remove the Generic1 instance for FingerTree.

treeowl · 2021-02-03T21:44:08Z

@emilypi and @Lysxia, I'd especially appreciate if one of you would check my documentation for the splitting functions in Data.FingerTree.IntPlus.

sjakobi · 2021-02-03T21:52:13Z

@treeowl Could you say something about the motivation for this PR? Ideally create an issue, so we can discuss the "problem" separately from this implementation.

Lysxia · 2021-02-03T21:52:31Z

containers/src/Data/FingerTree/IntPlus.hs

+uncheckedSplit :: Sized a => Int -> FingerTree a -> UncheckedSplit a
+uncheckedSplit i ft
+  | S.Split l m r <- S.splitTree i ft
+  = UncheckedSplit l m r


Instead of this partial function, why not only export split and let users deal with partial patterns if they want?

My only concern is whether that extra stuff will inline away. Certain functions are extremely sensitive to the performance of splitting tiny little sequences (e.g., 2–5 elements) where any little extra time/allocation can matter a lot. I'm not sure how hard it would be to rejigger Data.Sequence.Internal.splitTree to make sure that works out okay.

treeowl · 2021-02-03T22:02:55Z

@treeowl Could you say something about the motivation for this PR? Ideally create an issue, so we can discuss the "problem" separately from this implementation.

What really triggered it for me was wanting to play with incremental quicksort for sequences. Doing that (reasonably) efficiently requires use of FingerTree (Seq a) (which is a lot like Seq (Seq a), but with different annotations and therefore different splitting behavior). And the simplest (not simple) approach requires something like the splitMap function, which we already use to implement zipWith and chunksOf.

In principle, it would be nice to expand from (Int, +), to any Int-based monoid. But I'm scared of the efficiency considerations of doing that, and I don't know of any reasonably nice interface that doesn't lean on the extremely non-portable Coercible machinery. That is, one could imagine

data FingerTree s a
class Sized s a | a -> s where size :: a -> s
(<|) :: (Coercible s Int, Monoid s, Sized s a) => a -> FingerTree s a -> FingerTree s a

but then we're relying on an additional specialization along with all that non-portable stuff. Youch. So I figured I'd start with something conservative.

* Export a type for `(Int,+)` finger trees. * Export more `Data.Sequence` internals. * Offer a module of `Data.Sequence` internals intended for external use, that should obey the PVP.

Lysxia · 2021-02-03T22:21:02Z

I was about to mention https://hackage.haskell.org/package/fingertree but if the goal is to implement "incremental quicksort" for Seq it does seem necessary to do this in containers and sufficient to keep it specialized to (Int, +).

treeowl · 2021-02-03T22:27:59Z

I was about to mention https://hackage.haskell.org/package/fingertree but if the goal is to implement "incremental quicksort" for Seq it does seem necessary to do this in containers and sufficient to keep it specialized to (Int, +).

Well... not strictly necessary, I don't think, but the Ints are unpacked here (but not there). That seems especially important for the very short sequences. Also, the fingertree package has fallen behind in some optimizations.

sjakobi · 2021-02-03T23:41:15Z

So, do you intend to offer this incremental quicksort from a different package? Otherwise I don't understand why we need to enhance the public / stable API for this.

Or are you possibly talking about different sequence types than Seq here?

treeowl · 2021-02-03T23:53:30Z

I don't even know if I'll be able to make that practical; it's just what got me here. I think another application is ropes: finger trees of byte arrays.

…

On Wed, Feb 3, 2021, 6:41 PM Simon Jakobi ***@***.***> wrote: So, do you intend to offer this incremental quicksort from a different package? Otherwise I don't understand why we need to enhance the public / stable API for this. Or are you possibly talking about different sequence types than Seq here? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#766 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAOOF7M4NJQRRIZPDL4CEETS5HNKTANCNFSM4XBTXOVA> .

emilypi

This looks good on first pass!

Unless i'm missing something (i'm not super familiar with this library's internals), there are a few improvements we can make to ergnomics here. For example, I cannot fromList [1..10]. I must fromList $ Elem <$> [1..10], since Sized has only two instances. The ability to debug splits would also be welcome.

Thoughts?

emilypi · 2021-02-09T01:07:07Z

containers/src/Data/FingerTree/IntPlus.hs

+
+data Split a
+  = Split !(FingerTree a) a !(FingerTree a)
+  | EmptySplit


Do we have a debugging function that shows the split lying around? Would be useful to have.

treeowl · 2021-02-09T01:22:57Z

This looks good on first pass!

Unless i'm missing something (i'm not super familiar with this library's internals), there are a few improvements we can make to ergnomics here. For example, I cannot fromList [1..10]. I must fromList $ Elem <$> [1..10], since Sized has only two instances. The ability to debug splits would also be welcome.

Thoughts?

That fromList "limitation" is kind of essential to this structure. But we can add more Sized instances. I believe this PR adds one for Seq, we can get one for Array (and UArray?), and I guess inefficient ones for lists and non-empty lists too. Then there are potential derived instances for various base newtypes. Did you check my logic/arithmetic for splitting? And do you have a better name for split?

phadej

I cannot comment on this (not sure why you asked me to review). finger trees are unfamiliar structures to me.

phadej · 2021-07-28T12:01:52Z

containers/src/Data/Sequence/StableInternal.hs

+anything internal that is not exported, please file a GitHub issue.
+-}
+
+module Data.Sequence.StableInternal


Would Data.Sequence.Unsafe be better name?

treeowl requested review from sjakobi, Lysxia, phadej, ekmett, oisdk and emilypi February 3, 2021 21:38

treeowl force-pushed the seq-stable-internals branch from a5c4a0b to 1060462 Compare February 3, 2021 21:41

Lysxia reviewed Feb 3, 2021

View reviewed changes

treeowl force-pushed the seq-stable-internals branch from 1060462 to 5e746ca Compare February 3, 2021 21:53

treeowl force-pushed the seq-stable-internals branch from 5e746ca to 035e634 Compare February 3, 2021 22:08

Add (Int, +) finger trees

035e634

* Export a type for `(Int,+)` finger trees. * Export more `Data.Sequence` internals. * Offer a module of `Data.Sequence` internals intended for external use, that should obey the PVP.

emilypi reviewed Feb 9, 2021

View reviewed changes

phadej reviewed Jul 28, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add (Int, +) finger trees #766

Add (Int, +) finger trees #766

Uh oh!

treeowl commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021

Uh oh!

sjakobi commented Feb 3, 2021

Uh oh!

Lysxia Feb 3, 2021

Uh oh!

treeowl Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021 •

edited

Loading

Uh oh!

Lysxia commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021

Uh oh!

sjakobi commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021 via email

Uh oh!

emilypi left a comment

Uh oh!

emilypi Feb 9, 2021

Uh oh!

treeowl commented Feb 9, 2021

Uh oh!

phadej left a comment

Uh oh!

phadej Jul 28, 2021

Uh oh!

Uh oh!

Add (Int, +) finger trees #766

Are you sure you want to change the base?

Add (Int, +) finger trees #766

Uh oh!

Conversation

treeowl commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021

Uh oh!

sjakobi commented Feb 3, 2021

Uh oh!

Lysxia Feb 3, 2021

Choose a reason for hiding this comment

Uh oh!

treeowl Feb 3, 2021

Choose a reason for hiding this comment

Uh oh!

treeowl commented Feb 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lysxia commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021

Uh oh!

sjakobi commented Feb 3, 2021

Uh oh!

treeowl commented Feb 3, 2021 via email

Uh oh!

emilypi left a comment

Choose a reason for hiding this comment

Uh oh!

emilypi Feb 9, 2021

Choose a reason for hiding this comment

Uh oh!

treeowl commented Feb 9, 2021

Uh oh!

phadej left a comment

Choose a reason for hiding this comment

Uh oh!

phadej Jul 28, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

treeowl commented Feb 3, 2021 •

edited

Loading