Merge Tags #678

inukshuk · 2022-05-30T08:34:36Z

inukshuk · 2022-07-21T08:36:25Z

src/models/tag.js

+    let itemsWithOldTags = new Set()
+    // find all item ids which have taggings of tags to be deleted
+    for (const id of mergeIds) {
+      itemsWithOldTags.add(...await this.items(db, id))


Using this here and below is potentially dangerous. It will commonly happen that these helpers are invoked directly and not via the common object used as the default export. We may also change the way we export these model functions.

is there another way to access that function without using this? I don't understand the JS export system well enough to properly understand this issue

The issue itself is not so much about the module/export system (but it's likely to be exposed because of it).

The problem is that this is only bound when the function is invoked (unless it's a bound or an arrow function!). If we reference this.items here, this may or may not work depending on how the function is called. The way use these functions makes it very likely that we'll have to pass them around to be invoked later, which would likely break. For example:

let a = { f() { return this.g() }, g() { return '!' } } a.f() //-> '!' setTimeout(a.f, 25) //-> This will fail. It's similar to: let f = a.f f() //-> Which also fails.

It's very common that we have to pass these type of functions around (e.g., when we use them in Commands). The module/export dynamics also play a role, because named exports are the norm with ESM we may want to switch to named exports going forward so that we can import { merge } from './models/tags.js'.

In summary, you can easily address this by hoisting the items function similar to the load function (which is referenced by create). If we want to export all functions individually that's probably what we should do anyway, for all the model functions, but then we probably need address a few naming issues (e.g., the 'delete' function would clash with the keyword; 'items' in isolation is probably too cryptic etc..). Another alternative would be to still keep everything in a namespace object like we currently do, but also store it in a local variable for referencing purposes.

In reference to the earlier example, of course setTimeout(a.f.bind(a), 25) would work, but it's not good if we always need to remember when to bind functions and when it's safe to pass them around.

inukshuk · 2022-07-21T08:38:00Z

src/models/tag.js

+    await this.delete(db, mergeIds)
+
+    // recreate taggings for items which had deleted tags
+    for (const item of itemsWithOldTags) {


Lets fold the loop into the DB command. This way we should be able to send a single insert command for all items to the DB instead of one command for each item.

inukshuk · 2022-07-21T08:49:14Z

src/models/tag.js

+          ...into('taggings')
+            .insert({ tag_id: keepId, id: item }))
+      } catch (e) {
+        // TODO is there a nicer way to insert if not exists?


Yes, there should be! We can probably use REPLACE INTO which is just a SQLite shorthand for INSERT OR REPLACE INTO. But maybe it would be better to use ON CONFLICT IGNORE because REPLACE will remove the old tagging first and we probably want to keep the original? (Our 'query generator' classes are far from complete; if we use this here we can add a way to set e.g. onConflict('ignore') on the insert statement)

would INSERT OR IGNORE INTO be appropriate here? I didn't know SQLite had all these extra options around insert

Yes, I think INSERT OR IGNORE fits perfectly here. It applies to primary key constraints so we can use that instead of handling the error, or checking each tag/item combination first.

inukshuk · 2022-07-21T08:52:04Z

src/models/tag.js

+            .insert({ tag_id: keepId, id: item }))
+      } catch (e) {
+        // TODO is there a nicer way to insert if not exists?
+        console.error(e)


We must be careful not to swallow real errors here (so make sure to throw). Because this function should be called with a transaction, we can be sure everything will be rolled back if we the function throws an error.

inukshuk · 2022-07-21T08:57:43Z

test/models/tag_test.js

+    let db
+
+    mkdbtmp(x => db = x,
+      'db_test.sqlite', projectModel.create, { name: 'Test Project' })


Here is a good illustration about the potential dangers of using this in the model functions. If we use this in create here it will fail.

inukshuk · 2022-07-21T09:13:44Z

test/models/tag_test.js

+    mkdbtmp(x => db = x,
+      'db_test.sqlite', projectModel.create, { name: 'Test Project' })
+    // required to set up database schema
+    // TODO is there a better way to do this?


I think this is fine, but two things come to mind: I'd be interested to explore using an in-memory db for such kind of tests, because they're almost certainly going to be faster and we won't have to clean up the file afterwards. If I'm not mistaken then you should be able to call Database.create with path ":memory:" to create an in-memory db. We could adjust the mkdbtmp helper to skip any file related actions if that's the path.

The second thought was that we could integrate this into the fixture helpers maybe.

let p = F.project.create()

And this works sort of like mkdbtmp in that it adds before and after hooks which create the and close the db connection accessible under p.current (sort of like React's refs). In fact we should change mkdbtmp to also use that pattern then we can skip the callback and the tests are much easier to read. What do you say?

inukshuk · 2022-07-29T17:04:44Z

I've explored re-writing the mkdbtmp helper to return a React-like ref object, with the before/after hooks managing its current value. I think this way it's more straightforward to use, although having to type .current everywhere is somewhat annoying. What do you think about this?

I've also changed some of the tests in the process where I thought it would be useful. Most importantly probably, when you test for exceptions or rejections always use the .throws() or .rejected / .rejectedWith() matchers: that should be preferred over adding try/catch blocks to the tests themselves.

I may have made some other instructive changes -- just let me know if you anything catches your eye.

The most confounding thing though, is that I broke your merge tests and I haven't figured out why yet. The failure is so bizarre that I wonder if it's due to some underlying bug so I committed this with the failure for now hoping that it will be instructive for us to figure out what's going on.

Here is what I know so far:

The before hook that inserts tags and items fails. The error is caused by the trigger that adds metadata values to the full-text index. Specifically, it looks like the fts_metadata virtual table does not exist. At first I assumed this may have to do with the tests using an in-memory db, but the error is the same when I switch to using a temporary file. Next, I figured that there was an issue with the db-pool abstraction, that somehow we were opening a connection to an empty db. But I think I can rule that out too, when sending consecutive queries over the same connection in that before handler I see that fts_metadata exists in the sqlite_schema table, but querying e.g. select * from fts_metadata limit 1 still fails -- which is quite baffling.

inukshuk · 2022-07-30T10:51:48Z

OK I don't know yet why this fails, but I have isolated the issue now: it happens when you load the project schema and then keep using the db connection without closing it. Previously mkdbtmp used Database.create which created the db and closed it right away. This wasn't really on purpose originally, we just re-used the Database.create script for the test helper the way it was. So it looks like this was an unintentional workaround for an issue with our db schema we didn't know existed.

inukshuk · 2022-07-30T11:04:51Z

I think this is because virtual tables are registered per connection. When we restore the schema, the virtual table is added to the sqlite_schema, but probably not registered with the connection. We need to find out if there is a better way to dump and restore virtual tables; if there isn't that really rules out in-memory dbs if virtual tables are used. (A workaround would be to run all migrations instead to re-create the schema)

inukshuk · 2022-07-30T14:03:23Z

Seems like there's a simple solution!

inukshuk · 2022-07-30T16:28:24Z

OK I rebased the branch again and added some minor changes to make the tests more concise and this should all be working again now. The jury is still out on the new mkdbtmp helper: that we can use in-memory DBs now is definitely a plus which should definitely stay. I'm not fully convinced of using the React-style ref syntax. I do feel that it's much easier to understand because it works without the callback; on the other hand .current is a bit verbose. We could pick a shorthand of course, I don't know.

It also looks like these DB tests fail consistently on the Windows CI again. Interestingly also the tests using the in-memory DB are failing so contrary to my priors, this might not be a file system issue after all!

Sharp rebuilds recently started failing with this option.

inukshuk force-pushed the feature/merge-tags branch from 71c33cd to 0591aef Compare June 23, 2022 13:20

inukshuk force-pushed the feature/merge-tags branch 3 times, most recently from 2b15c92 to 13c989c Compare July 14, 2022 08:58

inukshuk mentioned this pull request Jul 18, 2022

Merge Tags #673

Open

inukshuk commented Jul 21, 2022

View reviewed changes

inukshuk force-pushed the feature/merge-tags branch from eda813e to fc053b3 Compare July 29, 2022 11:03

inukshuk force-pushed the feature/merge-tags branch from 8726add to e734467 Compare July 30, 2022 15:25

inukshuk and others added 7 commits July 30, 2022 17:42

Add separate keymap for TagList in TagPanel

85dfe98

Add onEdit prop to TagList

a67996b

Fix Sidebar keymap inheritance

bcc0e07

WIP: merge tags function, tests

1f51dd9

Simplify mkdbtmp helper

c6290f3

Keep single connection for memory db pool

3db10db

Update tests to use new mkdbtmp helper

e755cf8

inukshuk force-pushed the feature/merge-tags branch from e734467 to e755cf8 Compare July 30, 2022 15:42

inukshuk added 2 commits July 30, 2022 18:18

Simplify item model usage

48be28e

Avoid using RDF ids in tests

d5d26ae

Do not default to parallel builds

7c92871

Sharp rebuilds recently started failing with this option.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge Tags #678

Merge Tags #678

inukshuk commented May 30, 2022 •

edited

inukshuk Jul 21, 2022

caro401 Sep 7, 2022

inukshuk Sep 8, 2022

inukshuk Sep 8, 2022

inukshuk Jul 21, 2022

inukshuk Jul 21, 2022

caro401 Sep 7, 2022

inukshuk Sep 8, 2022

inukshuk Jul 21, 2022

inukshuk Jul 21, 2022

inukshuk Jul 21, 2022

inukshuk commented Jul 29, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

Merge Tags #678

Are you sure you want to change the base?

Merge Tags #678

Conversation

inukshuk commented May 30, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

inukshuk commented Jul 29, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

inukshuk commented Jul 30, 2022

inukshuk commented May 30, 2022 •

edited