A New User Similarity System

Offline

Aug 2006

386

Yeah, zarin said it a bit better than I did.

Well put.

#dontcare

Jan 25, 2008 7:07 PM

#83

Sarix

Offline

May 2007

670

Another point I said in the IRC discussion of this and expanding on my paragraph 3:

If a user has only seen good anime and knows that they have only seen good then using the shared mean is better (100% compatibility makes sense), which is what you do in your case. However as most people do not know that some of the anime they have watched is not really good but they just like anime in general then the total mean will be better overall. Using the shared mean has it's advantage in some situations and will produce a more accurate result, but as most people do not really know what they are rating using the total mean will be more accurate overall, even if it means being less accurate for a few.

SarixJan 25, 2008 7:15 PM

Jan 25, 2008 7:15 PM

#84

Offline

Mar 2005

3807

aisakku said:
Kei: what did you think of further normalization?

hmm?

Jan 25, 2008 9:04 PM

#85

Offline

Oct 2006

507

heh. thanks for the clarifications. I guess for the majority of users the total mean might be better. Even for me, it'll still be a good measure of compatibility. Over time, it'll only get even more accurate.

Jan 25, 2008 11:18 PM

#86

Offline

Aug 2006

386

kei-clone said:

aisakku said:
Kei: what did you think of further normalization?

hmm?

aisakku said:
Edit 2: for anyone curious about a further normalized system:

#dontcare

Jan 26, 2008 9:16 AM

#87

Offline

Oct 2006

507

I like that normalization in that the score spread my vary more, but I don't believe formula is correct for usera < meana
For "D(a) = usera(usera-meana)/(meana-1)," let's say mean = 8, that would mean that both a score of 1 and a score of 7 gives the same answer -1 as the value relating to the score below their mean.

Jan 26, 2008 10:14 AM

#88

Offline

Mar 2005

3807

aisakku said:

kei-clone said:

aisakku said:
Kei: what did you think of further normalization?

hmm?

aisakku said:
Edit 2: for anyone curious about a further normalized system:

hmm...I see the intention here but I'm not sure how much benefit this will bring over just straight up taking the difference between the mean and the score. I can see an immediate result though that all the numbers will result in a lot closer to a zero value though, and not sure that's a good thing.

Jan 26, 2008 11:25 AM

#89

Offline

Aug 2006

386

kei-clone said:

aisakku said:

kei-clone said:

aisakku said:
Kei: what did you think of further normalization?

hmm?

aisakku said:
Edit 2: for anyone curious about a further normalized system:

hmm...I see the intention here but I'm not sure how much benefit this will bring over just straight up taking the difference between the mean and the score. I can see an immediate result though that all the numbers will result in a lot closer to a zero value though, and not sure that's a good thing.

Yeah that is true, plus I'm pretty sure xinil would not want to code all of that.

scorpedo said:
I like that normalization in that the score spread my vary more, but I don't believe formula is correct for usera < meana
For "D(a) = usera(usera-meana)/(meana-1)," let's say mean = 8, that would mean that both a score of 1 and a score of 7 gives the same answer -1 as the value relating to the score below their mean.

Ahh yeah i see, though that'd be fixed if you change it and use the lowest output score instead of 1 for the denominator.

Edit: Oh I see the flaw with it.... would have to change the formula to be something like: (meana-usera)(usera-meana)/(meana-alow) for meana > usera

(making these formulae up from scratch so thanks for pointing that out.)

That would also turn usera > meana into:

(usera-meana)(usera - meana)/(10-usera)

aisakkuJan 26, 2008 11:32 AM

#dontcare

Jan 27, 2008 10:53 PM

#90

Offline

Oct 2006

507

I just realized that at the 'grand totals' on each list, Xinil already displays each users deviation based on the average of all members. Can that somehow be used instead of the user's mean?
Just throwing that out there in case it's easier/more accurate that way; haven't put too much thought into it.

Jan 27, 2008 10:55 PM

#91

Offline

Mar 2005

3807

scorpedo said:
I just realized that at the 'grand totals' on each list, Xinil already displays each users deviation based on the average of all members. Can that somehow be used instead of the user's mean?
Just throwing that out there in case it's easier/more accurate that way; haven't put too much thought into it.

I thought about that before too. But never was able to come up with a formula.

Jan 28, 2008 8:56 PM

#92

Sarix

Offline

May 2007

670

Here's some random comparisons between me and other MAL users:

Looking through it there really isn't a huge variety in similarities, which is a bad thing as it's kinda hard to look at the number and now exactly how similar you are. Adding text may help but with that small of difference it will be hard to properly correlate correct words to the scores.

Jan 29, 2008 7:53 AM

#93

Offline

Aug 2006

386

Zarin said:
Here's some random comparisons between me and other MAL users:

Looking through it there really isn't a huge variety in similarities, which is a bad thing as it's kinda hard to look at the number and now exactly how similar you are. Adding text may help but with that small of difference it will be hard to properly correlate correct words to the scores.

Again, the numbers for mine are currently off and should not be trusted. Both systems are going to differentiate from the original, as well as from each other,

The output from my system would be read the same as the previous, with 0 being the best possible score and 9 being the worst. It's possible to convert mine into percentages as well, but for now I believe we're mostly looking for input

#dontcare

Feb 1, 2008 4:00 PM

#94

Xinil

Overlord

Offline

Nov 2004

5752

I think, unless there's a large amount of disagreement, I'm just going to go with abh's percentage correlation. Sure, it's not as great as aisakku's, but it's easier to implement, and offers a better 'visual' and 'immediate' representation. If you guys disagree and want to get a poll going, go for it. I just want some movement on this.

Feb 1, 2008 5:09 PM

#95

Offline

Mar 2005

3807

sounds good to me

Feb 1, 2008 5:36 PM

#96

Offline

Aug 2006

386

My only real disagreements have been stated. His system is better than the current, so go for it.

#dontcare

Feb 1, 2008 5:53 PM

#97

Offline

Oct 2006

507

aye, abh's system is fine with me.

Feb 2, 2008 5:37 AM

#98

cyruz

Anime DB Admin

BACK FOR MORE?

Offline

Jan 2007

12974

No objections.

staff.applications　▼　
guidelines.faq　▼　

report.abuse　▼　

thx.skittles　▼　
thx.kina　▼　

[H+] ³　▼　

Feb 4, 2008 4:06 PM

#99