Draft: Feat/glom like key searching by mkrd · Pull Request #44 · mkrd/DictDataBase

mkrd · 2022-11-23T22:51:42Z

Things left to do:

Update docs

mkrd · 2022-11-23T22:52:43Z

tests/benchmark/run_parallel.py

-		res = f"✨ Scenario: {'🔹' * self.readers}{'🔻' * self.writers} ({self.readers}r{self.writers}w)"
-		res += ", 🔸 compression" if self.use_compression else ""
-		res += ", 💎 big file" if self.big_file else ""


I will put this in a different PR to keep this one clean

mkrd · 2022-11-27T15:28:03Z

dictdatabase/indexing.py

-	"""
-		The Indexer takes the name of a database file, and tries to load the .index file
-		of the corresponding database file.
-
-		The name of the index file is the name of the database file, with the extension
-		.index and all "/" replaced with "___"
-
-		The content of the index file is a json object, where the keys are keys inside
-		the database json file, and the values are lists of 5 elements:
-		- start_index: The index of the first byte of the value of the key in the database file
-		- end_index: The index of the last byte of the value of the key in the database file
-		- indent_level: The indent level of the key in the database file
-		- indent_with: The indent string used.
-		- value_hash: The hash of the value bytes
-	"""
-
-	__slots__ = ("data", "path")
-
-	def __init__(self, db_name: str):
-		# Make path of index file
-		db_name = db_name.replace("/", "___")
-		self.path = os.path.join(config.storage_directory, ".ddb", f"{db_name}.index")
-
-		os.makedirs(os.path.dirname(self.path), exist_ok=True)
-		if not os.path.exists(self.path):
-			self.data = {}
-			return
-
-		try:
-			with open(self.path, "rb") as f:
-				self.data = orjson.loads(f.read())
-		except orjson.JSONDecodeError:
-			self.data = {}
-
-
-	def get(self, key):
-		"""
-			Returns a list of 5 elements for a key if it exists, otherwise None
-			Elements:[start_index, end_index, indent_level, indent_with, value_hash]
-		"""
-		return self.data.get(key, None)
-
-
-	def write(self, key, start_index, end_index, indent_level, indent_with, value_hash, old_value_end):
-		"""
-			Write index information for a key to the index file
-		"""
-
-		if self.data.get(key, None) is not None:
-			delta = end_index - old_value_end
-			for entry in self.data.values():
-				if entry[0] > old_value_end:
-					entry[0] += delta
-					entry[1] += delta
-
-		self.data[key] = [start_index, end_index, indent_level, indent_with, value_hash]
-		with open(self.path, "wb") as f:
-			f.write(orjson.dumps(self.data))


@UmbrellaMalware let's stick with tabs so the diff looks clean

pep8 says spaces are preferred
This can become a problem if someone else wants to propose changes to the project

Yes I know. I personally prefer tabs, and think they are faster to work with. But if this project becomes bigger, I agree that it should adhere to most python community standards. But I'd like to keep the merge clean, the indentation and formatting stuff should be part of a separate issue/MR

mkrd · 2022-11-27T15:37:32Z

@UmbrellaMalware I did a quick rebase to incorporate die latest changes in main

UmbrellaMalware · 2022-11-27T15:47:52Z

@UmbrellaMalware I did a quick rebase to incorporate die latest changes in main

I need to test new functionality, I just found a bug in the search algorithm))

I'm also doing some documentation updates.

mkrd · 2022-11-27T19:46:38Z

@UmbrellaMalware I did a quick rebase to incorporate die latest changes in main

I need to test new functionality, I just found a bug in the search algorithm))

I'm also doing some documentation updates.

Do you mean the issue with reading a key value that is an integer or float?

UmbrellaMalware · 2022-11-27T19:48:59Z

@UmbrellaMalware I did a quick rebase to incorporate die latest changes in main

I need to test new functionality, I just found a bug in the search algorithm))
I'm also doing some documentation updates.

Do you mean the issue with reading a key value that is an integer or float?

No, if u try to read "Key.Subkey1.Subkey2" returns not found, but subkey2 exists

mkrd commented Nov 23, 2022

View reviewed changes

mkrd commented Nov 27, 2022

View reviewed changes

Danil Tolmachev added 11 commits November 27, 2022 16:35

add glom like searching

0a49406

reformat imports

c83342c

fix type hinting

e75f8eb

add tests

66a6a41

rename Searcher -> KeySearcher

325d3c7

add partial write

e856f71

fix print compatibility

0a105d9

renaming

a1c544b

add negative test

21b8a9f

add dataclass for searching

456aef2

add Index dataclass

4d0c4b8

mkrd force-pushed the feat/glom-like-key-searching branch from 25989d2 to 4d0c4b8 Compare November 27, 2022 15:36

mkrd added 2 commits November 27, 2022 16:39

use tabs

f53b456

revert docstring indent

d5499a0

mkrd changed the title ~~Feat/glom like key searching~~ Draft: Feat/glom like key searching Jul 25, 2023

mkrd force-pushed the main branch from 15308e3 to e3d6e83 Compare November 15, 2024 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Feat/glom like key searching#44

Draft: Feat/glom like key searching#44
mkrd wants to merge 13 commits intomainfrom
feat/glom-like-key-searching

mkrd commented Nov 23, 2022 •

edited

Loading

Uh oh!

mkrd Nov 23, 2022

Uh oh!

mkrd Nov 27, 2022

Uh oh!

UmbrellaMalware Nov 27, 2022

Uh oh!

mkrd Nov 27, 2022

Uh oh!

mkrd commented Nov 27, 2022

Uh oh!

UmbrellaMalware commented Nov 27, 2022

Uh oh!

mkrd commented Nov 27, 2022

Uh oh!

UmbrellaMalware commented Nov 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mkrd commented Nov 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkrd Nov 23, 2022

Choose a reason for hiding this comment

Uh oh!

mkrd Nov 27, 2022

Choose a reason for hiding this comment

Uh oh!

UmbrellaMalware Nov 27, 2022

Choose a reason for hiding this comment

Uh oh!

mkrd Nov 27, 2022

Choose a reason for hiding this comment

Uh oh!

mkrd commented Nov 27, 2022

Uh oh!

UmbrellaMalware commented Nov 27, 2022

Uh oh!

mkrd commented Nov 27, 2022

Uh oh!

UmbrellaMalware commented Nov 27, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mkrd commented Nov 23, 2022 •

edited

Loading