RF Generation.  The Classic and Modern Gaming Databases.RF Generation.  The Classic and Modern Gaming Databases.

Posted on Mar 18th 2025 at 01:50:09 AM by (Misto)
Posted under devlog

Hello, its time for the second entry discussing progress on the RFG rebuild.  I know its been a little quiet on the front page in this regard but work is progressing.  This time I'm mostly going to focus on the database work that is ongoing.

As I mentioned in the previous dev log, we have a large amount of data that we are planning to transform into a new structure to allow us more flexibility in adding game data.  However, this means we need to take all the existing data and move it to a new database.  This isn't a simple task since we are not performing a one-to-one migration.  A lot of the data needs to be cleaned up and reformed into the new structure.  The first hurdle is fixing the developers and publishers fields.  If you've ever added an entry to the site, you know that those fields are free form text - meaning you can put any text you want in there.  The problem that has is we have a lot of duplicate entries, spelling errors, and different styles people have used.  Take, for instance, a game with two developers (A and B) - I've seen all of the following different styles in the DB:

  • A / B
  • A & B
  • A and B
  • A with B
  • A, B
  • A; B
  • A (B)
  • A (for PC) / B (for Mac)

Not to mention duplicate entries (not including typos) for different company designations

  • A, Ltd.
  • A Ltd
  • A, Ltd
  • A Ltd.
  • A Limited

This makes it very hard to migrate into a "Company" table to normalize and very hard to manually go through the data to "fix" it.  We have nearly 15000 different unique entries just for companies.  I am slowly going through and fixing these to populate a new table that we can tie games and hardware to.  We also want to pull out all of the single person names to populate a "person" table that can be used for game credits as well as developer/publisher fields. The good news is once its done, we won't have to worry about duplicates or typos going forward.  When submitting a game, you'll just need to select the company from the existing values (don't worry there will be a way to add companies as well if needed).  Further, since its hard to verify that there still won't be duplicates after this, I'm planning on adding a "merge companies" function for site admins in case there were mistakes in the migration.

Once the companies are migrated, there will be a few smaller tables that will need the same treatment such as controller and genre fields.  Then a more programmatic approach can be taken to migrate all the actual games and hardware and tie them to new entries for all the mentioned fields - standardizing all the data.  This data work will also allow us to start building concrete APIs and UI mockups to move forward which will be much more tangible for most people reading this.

Anyway that's all for this one, I hope it was a fun read.  Once more progress is made, I imagine entry 3 will discuss some more of the data structure and API work that will need to happen to move forward.


Permalink | Comments [10] | Digg This Article |


Recent Entries
RFG Dev Log #3 (4/9/2025)
Thoughts on Everhood 2 (4/2/2025)
RFG Dev Log #2 (3/18/2025)
A Quick Update (3/3/2025)
Game Room Project - Feb 2025 - The Arcade Cabinet (2/11/2025)


Comments
 
Thanks for the update!  I'm so grateful you are heading up this task, as the workload is obviously intimidating.  Also a thanks to everyone volunteering to help get the database cleaned up once we get to a further point for access.

And to the folks volunteering contributions for the donations raffle, I assure you guys the reason we haven't moved that forward is due to complications with the server account transfer (which we are still getting sorted out.)

I know this process has gone from weeks to months and beyond, so much so that I've stopped repeating what I thought was around the corner.  But we are indeed getting there, inch by inch (or centimeter by centimeter, for you weird folks that like sensible measuring metrics.)

Smiley
 
You are doing god's work. In the end, it will be amazing to have these fields standardized.
 
This sounds like a monumental task to clean up the listings so they're more uniform. Your work is greatly appreciated!
 
I don't know if a certain habit of mine has contributed to all this "extra" text that has to be dealt with, but I swear it comes from a good place. I've submitted alternative titles to games already in the database in order to help users find them more easily, mainly due to the fact that I used certain terms in a search that would either reult in "no games in our database match your query," or some of the games in that series would show up and others wouldn't, including the one I was llooking for.

For example, I was looking to add a Star Fox (two words) title by searching using the term "StarFox" (one word). While some of the Star Fox titles would come up, the particular one I was looking for wasn't. So I did another search separating the words, and many of the other titles would show up in the results, but others were still absent. So I went through every StarFox/Star Fox title in the database, and submitted alt titles "Star Fox" to StarFox entries, and "StarFox" to Star Fox entries.

So... have my efforts to make database searching a little more easy for my fellow users resulted in making much needed changes to the site more difficult? Oh, the crushing guilt...
 
@Zagnorch P. Welinskivich II, Esq.:

Hey, I agree with you, the search sucks.  I've searched for titles in many ways and couldn't find them.  I'd submit a new submission and I'd got a message back on the website that said my submission has been denied because it's already in the database with a link to it.  When I inquired how they found it/searched for it, I didn't really get a reponse answering my question.
 
Hi. Havent really been around for a while. Still got my collection, but I kind of ran out of room and really cut back on buying. So have become a lot less involved in the scene.

I'm really glad though to see the site is getting a good facelift. Some of the ideas being proposed I thought would have been cool back when I was involved. Its going to be great when it's done.

Good luck with the rebuild!
 
@Zagnorch P. Welinskivich II, Esq.:

Don't worry, the alternate names are not affecting this. The alternate titles should just be listed as an alias in the root game for searching purposes.
 
@Misto: Well that's good to hear. Searching for a fairly common title that should be on the database and not finding it immediately has always been my biggest frustration about this place. You know, first-world problems and all that. But at least RF Gen empowers me to submit search terms to prevent future users from suffering similar frustrations. I can't think of any other online places that give its community such empowerment, which is one reason this has been my fave online hangout for many years.

On another note: the greatest miracle you could ever pull off in the renovation of the site is the restoration of all the forum posts between late March 2009 to early February 2012 that were lost in the Great Outage. Three years history, gone. It's the internet equivalent of the Library of Alexandria fire to be sure. That having been said, I hope the site renovation is robust enough to not suffer another such loss in the future.

More power to you all...
 
@Zagnorch P. Welinskivich II, Esq.:

Yea, I'm hoping to really improve search results & capabilities with the restructure.  I did a migration of all the company data a few days ago (what this post was talking about) and while its probably not 100% perfect at removing duplicates and typos, its certainly better.

The big point however, is I added database support for this table (and others when I get to them) to perform full text searching.  Basically we'll be able to search against multiple data points much easier.  On companies - name, aliases, and description will be included.  I'll talk more about this in the next dev log.

As for the forum posts that were lost, I'll have to do some digging.  I was around during that time but wasn't active very much on the forums so I don't remember that issue.  If the posts are not in the current databases, however, I'm afraid they are lost for good. I'll see if I can find anything though.
 
Super glad to see all this effort, and thanks for all your hard work!

 Login or register to comment
It appears as though you are not a member of our site, or are not logged in.
It appears as though you can not comment currently. Becoming able to comment though is easy! All you need to do is register for the site! Not only will you be able to access any other site features including the forum and collection tools. If you are a registered user and just need to login then you can do so here.

Comment! It's easy, thoughtful, and who knows you might just enjoy it!
                                                                                                                                                                                                                                               
Login / Register
 
 
Not a member? Register!
Database Search
Site Statistics
Total Games:
152109
Total Hardware:
11278
Total Scans:
184475
Total Screenshots:
83352
[More Stats]
Our Friends
Digital Press Video Game Console Library NES Player The Video Game Critic Game Rave Game Gavel Cartridge Club Android app on Google Play RF Generation on Discord
Updated Entries
United States
(SW)

United States
(PS5)

United States
(PS5)

United States
(PS4)

United States
(PS4)

United States
(XBS)

North America
(PS5)

Spain
(PC)
Updated Collections
New Forum Topics
New on the Blogs
Nielsen's Favorite Articles

Site content Copyright © rfgeneration.com unless otherwise noted. Oh, and keep it on channel three.