🎉 Celebrating 25 Years of GameDev.net! 🎉

Not many can claim 25 years on the Internet! Join us in celebrating this milestone. Learn more about our history, and thank you for being a part of our community!

Back to Networking and Multiplayer

Commiting persistent server data

Acar · 2016-10-20T16:02:42

Hi. Upon advices some of you guys have given, I've switched to using a simple 'c structure' based data storage where I just dump the data in memory into a file instead of using a database. It works wonders for the most part, but I have a concern regarding certain types of data which can reach very high amounts of entries. Currently, I commit all of the data as I don't have a way of detecting which entries have changed. I ran a test and it took 160 seconds to commit 5 million entries(each entry being 256 bytes). This is what I have at the moment: struct PersistentData { ItemData item_data[MAX_ITEMS]; unsigned int num_item_data; unsigned char removed_item_data[MAX_ITEMS]; /* Other types of data in same format */ }; ItemData *addItemData(PersistentData *pd, ItemData *data) { if(pd->num_item_data >= MAX_ITEMS) { assert(0); return 0; } memcpy(&pd->item_data[pd->num_item_data], data, sizeof(ItemData)); return &pd->item_data[pd->num_item_data++]; } void getItemData(PersistentData *pd /* List pointer, max list size, num_results pointer, comparison parameters, etc. */) { unsigned int num_res = 0; for (unsigned int i = 0; i < pd->num_item_data; i++) { if(0 /* compare */) { list[num_res++] = &pd->item_data[i]; /* Break if reached max list size */ } } *num_results = num_res; } int removeItemData(PersistentData *pd /* comparison parameters */) { for (unsigned int i = 0; i < pd->num_item_data; i++) { if(0 /* compare */) { pd->removed_item_data[i] = 1; return 1; } } return 0; } int commitPersistentData(PersistentData *pd) { /* Open file, etc. */ for (unsigned int i = 0; i < pd->num_item_data; i++) { if(pd->removed_item_data[i] == 0) { /* Write to file */ } } /* Close file */ return 1; } I believe it takes so long because I'm writing each entry one by one. When I ignore the removed entries and write the data all at once it takes about 15 seconds! If I could move the last entry into the removed entry I could write them all at once but that is not an option either as another part of the program could have a pointer to last entry. Is there a complicated or even better, a simple, solution which I could implement?

Networking and Multiplayer Programming

Started by Acar October 19, 2016 10:04 AM

15 comments, last by Acar 7 years, 8 months ago

hplus0603

11,919

October 20, 2016 12:41 AM

if I were to keep the data in a file in disk and if the data is unsorted, wouldn't it take too long to query what I need from that file?

If you don't save/load everything, but keep everything in an unstructured file on disk, then yes, that is kind-of the slowest of both worlds :-)

If you save/load pieces, then you typically either put each piece in some kind of database. This can be something heavyweight like MySQL or DB/2, or something simple like Berekeley DB or just a file with a table of contents index at the front, or something super-simple like a file per user/entity (with some recursive directory hierarchy to avoid 100,000 files in one directory.)

Also if the memory usage isn't a concern, is it still a bad idea to keep all data in memory?

If the performance is fine (maybe you can save the data asynchronously?) and you have no better use for the RAM, then that's fine.

Another option you may want to consider is memory mapping the file, which makes the file be automatically saved all the time without you having to worry about it.
mmap() on Linux; CreateFileMapping() on Windows.

enum Bool { True, False, FileNotFound };

Kylotan

10,512

October 20, 2016 09:11 AM

Also if the memory usage isn't a concern, is it still a bad idea to keep all data in memory?

Eventually you'll run out. You'll have almost a thousand times more disk space than memory space, so you may as well use it where it makes sense. If you're using files anyway, there's no good reason to keep data in memory that you're not using, and may never use again (e.g. if a player never comes back).

Acar

1,159

Author

October 20, 2016 09:16 AM

.. or just a file with a table of contents index at the front ..

How would I go about implementing this because I couldn't find a way to append/subtract data at a certain offset in a file?

Also, hopefully not going off-topic, if I were to keep them in a file in a sorted format and only load/commit the active data, how would I go about implementing it, especially the sorting part. Is there any source code which I could look into?

Eventually you'll run out. You'll have almost a thousand times more disk space than memory space, so you may as well use it where it makes sense. If you're using files anyway, there's no good reason to keep data in memory that you're not using, and may never use again (e.g. if a player never comes back).

Any implementations you could recommend which is loading/saving from/to a sorted file? I'm using C and Windows but anything which would help me grasp the concept would be helpful.

Kylotan

10,512

October 20, 2016 09:42 AM

I don't think a single sorted file is the way to go, because they are not very practical. Hplus0603 already gave one good alternative - a file per user/entity/player. That way, the file system is your sorting mechanism.

If you really want it to be all in one file, with quick access to arbitrary parts of the file, then there's software that will do that for you, and it's called a database. Again, Hplus0603 mentioned some names above.

Acar

1,159

Author

October 20, 2016 09:51 AM

I don't think a single sorted file is the way to go, because they are not very practical. Hplus0603 already gave one good alternative - a file per user/entity/player. That way, the file system is your sorting mechanism.

If you really want it to be all in one file, with quick access to arbitrary parts of the file, then there's software that will do that for you, and it's called a database. Again, Hplus0603 mentioned some names above.

Thanks for the reply. I'll look into that alternative.

hplus0603

11,919

October 20, 2016 03:47 PM

there's no good reason to keep data in memory that you're not using, and may never use again (e.g. if a player never comes back).

Shh! Don't tell the Redis folks!

(FWIW: We run the biggest single Redis instance I know of, with 768 GB of RAM. This turned out to be a mistake, because the entire kernel locks up for 10 seconds each time it forks to checkpoint the data.)

How would I go about implementing

At that point, you're building the lowest-level component of a database (or, for that matter, file system) which is "indirect block allocation and management."
A very simple way to do that is to split your file into chunks, say 1 MB each, and have each chunk link to the next chunk when it gets full. To read all the data, you follow the chain of links and concatenate all the data.
A slightly more sophisticated way is to make the first chunk be an array of chunk offset, and each time you need another 1 MB chunk, add a new offset to the table, and when the table runs out, you either say "file is full," or you apply the linked-list of table chunks, or you add a second layer of indirection.
(Chunk size varies by application -- 1 MB may be way too big or not big enough, depending on what you're doing.)
An even more sophisticated way of doing this is to structure your data in an ordered index -- at this point, you'll want to read up on B-trees, B*-trees, and other such structures, because you're well on your way to building your own database!

Simple math example:
Let's assume 1 MB chunks. Let's assume 64 bit file offset.
1 MB can fit 128K of file offset pointers. Each pointer references a 1 MB chunk of file data.
Maximum size of data stored in file: 128K * 1M == 128 GB of data.

enum Bool { True, False, FileNotFound };

Acar

1,159

Author

October 20, 2016 04:02 PM

A very simple way to do that is to split your file into chunks, say 1 MB each, and have each chunk link to the next chunk when it gets full. To read all the data, you follow the chain of links and concatenate all the data.
A slightly more sophisticated way is to make the first chunk be an array of chunk offset, and each time you need another 1 MB chunk, add a new offset to the table, and when the table runs out, you either say "file is full," or you apply the linked-list of table chunks, or you add a second layer of indirection.
(Chunk size varies by application -- 1 MB may be way too big or not big enough, depending on what you're doing.)
An even more sophisticated way of doing this is to structure your data in an ordered index -- at this point, you'll want to read up on B-trees, B*-trees, and other such structures, because you're well on your way to building your own database!

Simple math example:
Let's assume 1 MB chunks. Let's assume 64 bit file offset.
1 MB can fit 128K of file offset pointers. Each pointer references a 1 MB chunk of file data.
Maximum size of data stored in file: 128K * 1M == 128 GB of data.

Much appreciated the detailed answer. I'll read up on these and invest further time into building something similar.