Forum Settings
Forums
New
Pages (3) « 1 2 [3]
Jan 25, 2008 6:58 PM
Offline
May 2007
670
scorpedo said:
hmm. I think I understand what you're saying, but the problem is that the total mean of a user should not imply what they think is average. I have a mean of 8.0, but the anime's I gave a 7 to I still consider good.
heh, I apologize if I'm not making sense or can't quite get the concept :X


Edit: Damnit, in the time I wrote this post it was already rebutted. >_<

If you watch 300 anime and rate every single one above 6 doesn't mean that they are all above average shows, it just means you like anime in general. If someone else has also watched 300 shows and likes the ones shared just as much as you do but takes into account their love for anime and uses the full rating range then the two should have a high compatibility.

Repeating what kei-clone said: Assuming the average for both users is 8 and 5 respectively then if both the users rated the show 7 then the second person actually likes it more than the second when looking at the whole scheme of things. If the second user rates a show as a 3 doesn't mean they absolutely hate it and have dropped it. I'm one of those people, I have rated the entire Shana series a 3 but I still enjoy it and I still plan to watch the entire series and any movies and specials that come out and if I was not using the full rating range like when I first joined I would have given it at least a 6.

Now using the shared mean is fine if both users share the majority of their shows, the real problem happens if they only share a little. Take for example if User A (The one with a high average and has watched 50 shows) shares 100% of their anime with User B (The one with a low average and has watched 300 shows) and both of them are given identical scores. Using the shared average they would have 100% compatibility, which doesn't make sense as... as... actually I see why you are saying use the shared mean and I don't think that there are any good examples that would show how using the total mean is any better.

I guess you really just have to look at what the scores mean themselves instead of a compatibility example (My paragraph 2). You may still consider a 7 as a good anime, but I still consider 4 as a good one too.
Jan 25, 2008 7:01 PM

Offline
Aug 2006
386
Yeah, zarin said it a bit better than I did.

Well put.
#dontcare
Jan 25, 2008 7:07 PM
Offline
May 2007
670
Another point I said in the IRC discussion of this and expanding on my paragraph 3:

If a user has only seen good anime and knows that they have only seen good then using the shared mean is better (100% compatibility makes sense), which is what you do in your case. However as most people do not know that some of the anime they have watched is not really good but they just like anime in general then the total mean will be better overall. Using the shared mean has it's advantage in some situations and will produce a more accurate result, but as most people do not really know what they are rating using the total mean will be more accurate overall, even if it means being less accurate for a few.
SarixJan 25, 2008 7:15 PM
Jan 25, 2008 7:15 PM

Offline
Mar 2005
3807
aisakku said:
Kei: what did you think of further normalization?


hmm?
Jan 25, 2008 9:04 PM

Offline
Oct 2006
507
heh. thanks for the clarifications. I guess for the majority of users the total mean might be better. Even for me, it'll still be a good measure of compatibility. Over time, it'll only get even more accurate.
Jan 25, 2008 11:18 PM

Offline
Aug 2006
386
kei-clone said:
aisakku said:
Kei: what did you think of further normalization?


hmm?


aisakku said:
Edit 2: for anyone curious about a further normalized system:


#dontcare
Jan 26, 2008 9:16 AM

Offline
Oct 2006
507
I like that normalization in that the score spread my vary more, but I don't believe formula is correct for usera < meana
For "D(a) = usera(usera-meana)/(meana-1)," let's say mean = 8, that would mean that both a score of 1 and a score of 7 gives the same answer -1 as the value relating to the score below their mean.
Jan 26, 2008 10:14 AM

Offline
Mar 2005
3807
aisakku said:
kei-clone said:
aisakku said:
Kei: what did you think of further normalization?


hmm?


aisakku said:
Edit 2: for anyone curious about a further normalized system:




hmm...I see the intention here but I'm not sure how much benefit this will bring over just straight up taking the difference between the mean and the score. I can see an immediate result though that all the numbers will result in a lot closer to a zero value though, and not sure that's a good thing.
Jan 26, 2008 11:25 AM

Offline
Aug 2006
386
kei-clone said:
aisakku said:
kei-clone said:
aisakku said:
Kei: what did you think of further normalization?


hmm?


aisakku said:
Edit 2: for anyone curious about a further normalized system:




hmm...I see the intention here but I'm not sure how much benefit this will bring over just straight up taking the difference between the mean and the score. I can see an immediate result though that all the numbers will result in a lot closer to a zero value though, and not sure that's a good thing.


Yeah that is true, plus I'm pretty sure xinil would not want to code all of that.


scorpedo said:
I like that normalization in that the score spread my vary more, but I don't believe formula is correct for usera < meana
For "D(a) = usera(usera-meana)/(meana-1)," let's say mean = 8, that would mean that both a score of 1 and a score of 7 gives the same answer -1 as the value relating to the score below their mean.


Ahh yeah i see, though that'd be fixed if you change it and use the lowest output score instead of 1 for the denominator.

Edit: Oh I see the flaw with it.... would have to change the formula to be something like: (meana-usera)(usera-meana)/(meana-alow) for meana > usera

(making these formulae up from scratch so thanks for pointing that out.)

That would also turn usera > meana into:

(usera-meana)(usera - meana)/(10-usera)
aisakkuJan 26, 2008 11:32 AM
#dontcare
Jan 27, 2008 10:53 PM

Offline
Oct 2006
507
I just realized that at the 'grand totals' on each list, Xinil already displays each users deviation based on the average of all members. Can that somehow be used instead of the user's mean?
Just throwing that out there in case it's easier/more accurate that way; haven't put too much thought into it.
Jan 27, 2008 10:55 PM

Offline
Mar 2005
3807
scorpedo said:
I just realized that at the 'grand totals' on each list, Xinil already displays each users deviation based on the average of all members. Can that somehow be used instead of the user's mean?
Just throwing that out there in case it's easier/more accurate that way; haven't put too much thought into it.


I thought about that before too. But never was able to come up with a formula.
Jan 28, 2008 8:56 PM
Offline
May 2007
670
Here's some random comparisons between me and other MAL users:



Looking through it there really isn't a huge variety in similarities, which is a bad thing as it's kinda hard to look at the number and now exactly how similar you are. Adding text may help but with that small of difference it will be hard to properly correlate correct words to the scores.
Jan 29, 2008 7:53 AM

Offline
Aug 2006
386
Zarin said:
Here's some random comparisons between me and other MAL users:



Looking through it there really isn't a huge variety in similarities, which is a bad thing as it's kinda hard to look at the number and now exactly how similar you are. Adding text may help but with that small of difference it will be hard to properly correlate correct words to the scores.


Again, the numbers for mine are currently off and should not be trusted. Both systems are going to differentiate from the original, as well as from each other,

The output from my system would be read the same as the previous, with 0 being the best possible score and 9 being the worst. It's possible to convert mine into percentages as well, but for now I believe we're mostly looking for input
#dontcare
Feb 1, 2008 4:00 PM
Overlord

Offline
Nov 2004
5752
I think, unless there's a large amount of disagreement, I'm just going to go with abh's percentage correlation. Sure, it's not as great as aisakku's, but it's easier to implement, and offers a better 'visual' and 'immediate' representation. If you guys disagree and want to get a poll going, go for it. I just want some movement on this.
Feb 1, 2008 5:09 PM

Offline
Mar 2005
3807
sounds good to me
Feb 1, 2008 5:36 PM

Offline
Aug 2006
386
My only real disagreements have been stated. His system is better than the current, so go for it.
#dontcare
Feb 1, 2008 5:53 PM

Offline
Oct 2006
507
aye, abh's system is fine with me.
Feb 2, 2008 5:37 AM
Anime DB Admin
BACK FOR MORE?

Offline
Jan 2007
12974
No objections.

staff.applications  
guidelines.faq 
 

report.abuse  

thx.skittles  
thx.kina 
 

[H+] ³  
Feb 4, 2008 4:06 PM

Offline
Aug 2006
386
As noted in the other thread, Xinil, I recommend putting a higher limit on the number of entires needed to have a compatibility rating done (or one in general if there isn't any right now). Especially with a system like this, it's going to be very inaccurate with only a couple of entires.
#dontcare
Feb 5, 2008 3:00 AM

Offline
Jun 2007
82
First, thanks for implementing my system. As aisakku said there should be at least a certain fixed number of entries for good accuracy. It is also so in my userscript. And thanks to all the people who contributed to this discussion over past few months. Now I am going to work upon a new proposal, an API for MAL, which will enable developing applications for MAL, such as a tracker.
Jun 15, 2014 9:20 PM

Offline
Nov 2011
9206
Does the greasemonkey script currently use aisakku's method?

I was reading in detail until somewhere in page two, at which point I started skimming. Lots of math and I got a bit lazy. Anyway, the gist I got from it is that aisakku's method detects the similarity of scoring patterns between users, which would account for differences in personal rating scale (for example, 6 is "true neutral" for me, despite 5 being labeled "average", which I consider to be mildly negative). If the script uses said method, I would prefer that to the current system.

I'm a bit saddened a poll was never started here (although I wouldn't have been able to participate in it at the time). I feel rating patterns would make for a much better comparison than simple vector comparison, since vector comparison is still tied to the "default" rating system, which many users (including myself) may not abide by.
Jun 15, 2014 11:24 PM

Offline
Apr 2011
522
TripleSRank said:
Does the greasemonkey script currently use aisakku's method?

Yes. And the current system adopts that method (i.e. Pearson product-moment correlation coefficient), which is resistant to simple linear transformation (e.g. ax+b).
Jun 2, 2015 7:42 AM
Offline
Sep 2014
3
abhin4v said we could download the scrip from http://userscripts.org/scripts/show/12746 but that url seems to be broken. Where can I get it now?
Jun 2, 2015 9:33 AM

Offline
Oct 2009
7742
RitaDnz said:
abhin4v said we could download the scrip from http://userscripts.org/scripts/show/12746 but that url seems to be broken. Where can I get it now?

http://userscripts-mirror.org/scripts/show/12746
However, I think Xinil already implemented it natively into the site, so I am not sure you need it
Pages (3) « 1 2 [3]

More topics from this board

» To Make Manga List Updating More Convenient

Saintseiya100 - 2 hours ago

0 by Saintseiya100 »»
2 hours ago

» An option to see which users have favorited a series or characters

JKKH - Oct 7

2 by Shishio-kun »»
Yesterday, 1:07 PM

» new list option - "Maybe"

dailydi - Oct 12

8 by Shishio-kun »»
Yesterday, 7:25 AM

» Yearly Wrapped like MyDramaList ?

ame - Sep 6

13 by ScaryOwl »»
Oct 11, 10:47 AM

» Abusing the Number of Shared Anime and Affinity to Uncover Private Lists

vazae - Mar 7, 2021

6 by Serhiyko »»
Oct 10, 7:01 AM
It’s time to ditch the text file.
Keep track of your anime easily by creating your own list.
Sign Up Login