Let's Go Tribe!: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Around SBN: Can Tebow Say No To Anything?

DiamondView Lite

DiamondView is the name of the software the Indians use to collect and organize data about players.  It's supposed to be as automated as possible to save time and effort.  As the PD put it: “Consider it a kind of truth machine, whose function is to temper, or even override, subjective measurements with objective ones.”  I like the idea of a truth machine and have always wanted one myself.

Now, I'm no expert when it comes to spreadsheets and programming, but with a couple hours, I made some headway toward a DiamondView Lite.

Initial Desiderata

- to automatically keep track of the stats of the Prospects that Matter in a single spreadsheet.
- to upload the information to Google Docs or some such to share with the LGT community.

Implementation so far

Using Baseball Reference's minor league stats and Neo Office, I'm able to import the raw data for each player into a spreadsheet.  I can then select the specific row of information I'm looking for (in this case the basic stats for 2008) and collect it with the other players on one sheet.  I can then parse the raw row information into separate cells.

I'm not sure, but I don't think Google Docs allows extraction of web data, so I can't have the spreadsheet update itself online there.  I can, however, upload the already updated spreadsheet for everyone to see.

Results

1. I've got the 2008 data for most of the hitters on the Prospects that Matter list.  It should be fairly easy to continually update the data as B-Ref updates their stats.  It's not that hard to add or delete players.

2. Three hours lost not working on my dissertation.

Additional Benefits

The stats are sortable and allow comparisons among Indians prospects that ordinarily would be much more laborious to conduct.  The implementation is easily extendible to other stats, game data, and whatever else B-Ref offers. 


Issues for Discussion

1. I'm not sure about the legality of posting the spreadsheet on Google Docs or my website.  I don't, of course, intend to make a profit off the thing, but it might be bad to display B-Ref data elsewhere.  On the other hand, it's really just an extension of what we and many others already use the data for.

2. Would you find this a useful thing to have?

3.  Any additional hints on implementation or requests for features?

1 recs  |  Comment 13 comments

Story-email Email Printer Print

Comments

Display:

sounds bad ass—post that! Only question: what color is this truth machine?

by BrendanGreen on Jul 11, 2008 6:23 AM EDT reply actions   0 recs

1. There are enough other lawyers and law students who post here to tell me I’m wrong (not to mention Chuck), and I do not practice in the field of intellectual property, but I looked at the terms of use on their web site.

They “own or license the data, content, graphics, forms, artwork, and other material for this Site (the “Content”), as well as the selection, coordination, arrangement, and organization and enhancement of the Content (also, the “Content”) for [the] Site.”

Use, reproduction, or distribution of the Content is prohibited, but you can download or print one copy, “for your use in learning about, evaluating, or acquiring SRL’s or its licensees’ or licensors’ services or products,”

Interstingly, the terms of use allows search engines to copy the material to make search indexes of the material but not to cache pages. Appearantly Google has not read that part or does not follow it.

If the data you are getting from B-R is simply information about performance of players – AB’s, Hits, RBI’s, HR’s, etc that is publically available from multiple sources, I don’t think they’re going to come after you for using it as you have and sharing it with other interested people like us. If you’re using something B-R has produced as an analysis of that data, you’re getting into their intellectual property which could produce a problem if they ever knew you did it.

(Keep in mind you’re getting this comment from a guy who’s using a copywritten photo as his avatar.)

2. Hell yes.

3. Not until I see how it works.

"It's hard to win when you don't score." Cliff Lee, 9/28/05.

by Harry Doyle on Jul 11, 2008 11:08 AM EDT reply actions   0 recs

F.S., could just email B-Ref and ask?

by Brick. on Jul 11, 2008 11:26 AM EDT up reply actions   0 recs

But where’s the fun in that?

"It's hard to win when you don't score." Cliff Lee, 9/28/05.

by Harry Doyle on Jul 11, 2008 2:18 PM EDT up reply actions   0 recs

I’ve had to do some work on this. The raw results data itself is public domain like any other hot news. And if I remember by IP class right, raw math can’t be protected either. What B-Ref could probably protect is the code which distributes the data and makes it pretty and accessible and up-to-date. It would make an interesting case as to how much of that was going into FranklinScott’s database.

Of course, from a practical perspective, B-Ref (being, as it is, a couple of dudes) probably doesn’t have the resources to employ a legal dream team, and I feel safe in assuming FranklinScott isn’t a lip-lickingly deep-pocketed defendant. If it were me, I’d sponsor Eddie Mujica’s page and maybe ask nicely.

by fleerdon on Jul 11, 2008 11:49 PM EDT up reply actions   0 recs

I don’t know if the raw results data is entirely public domain. B-Ref pays for those data feeds.

by Jay on Jul 12, 2008 3:22 PM EDT up reply actions   0 recs

My bet is that data feed is the key term there. The fact that events like home runs actually occurred is public. If, for example, B-Ref paid some people to watch every game and keep an electronic scorecard, that would be fine. (Motorola did something similar and survived the lawsuit.) But the question would be, how much of the direct line out of MLB belongs to MLB, and by extension, B-Ref, and for how long? As I said, interesting case, and one I doubt anybody would bother to prosecute.

MLB would love for the data to be proprietary, by the way, because the league would then become a licensor for every fantasy league in existence. Then we could have black market fantasy baseball. Fun!

by fleerdon on Jul 12, 2008 8:30 PM EDT up reply actions   0 recs

B-Ref (being, as it is, a couple of dudes) probably doesn’t have the resources to employ a legal dream team, and I feel safe in assuming FranklinScott isn’t a lip-lickingly deep-pocketed defendant.

but he could assemble a team of lawyers and law students to come to his aid, once again scaring off the dudes at B-Ref.

"It's hard to win when you don't score." Cliff Lee, 9/28/05.

by Harry Doyle on Jul 13, 2008 1:06 AM EDT up reply actions   0 recs

Sean (the B-Ref guy) is a good guy and well-liked by everyone in the baseball blogosphere and many within the actual baseball industry. There is no reason to be ripping him off on the sly just because a couple of people here are lawyers. It’s both poor form and a dumb move.

by Jay on Jul 13, 2008 5:38 PM EDT up reply actions   0 recs

If you look at my original analysis, I thought that Franklin was not violating B-Ref’s usage policy by using raw information within the public domain. I took a different point of view with any analysis that made the information their intellectual property.

I do not support ripping anyone off on the sly (or otherwise), even if you really do have a team of lawyers at your disposal.

"It's hard to win when you don't score." Cliff Lee, 9/28/05.

by Harry Doyle on Jul 14, 2008 2:09 PM EDT up reply actions   0 recs

If you get scared off by the intellectual property, there is another way. Minor league game data is available in XML format similar to MLB Gameday information. Here is the first inning of last night’s Aeros game. There are several publically available XML parsers on the web specifically for the Gameday output that people use for PitchFX analysis that take the XML and put it into a spreadsheet or (even better) a database for manipulation. Someone with programming skills could throw something together fairly quickly. Here, for example, is a Python script that grabs all data for every Braves minor league team for a whole season – changing it to the Tribe is a matter of changing

‘ricaaa’, ‘msbaax’, ‘myrafa’, ‘romafx’, ‘dnvrok’, ‘brarok’, ‘dbrrok’
to
‘bufaaa’, ‘akraax’, ‘kinafa’, ‘lcoafx’, ‘mvsasx’, ‘dinrok’, ‘indrok’

That’s probably more trouble than it’s worth, but it is certainly legal.

by FredOx on Jul 11, 2008 12:21 PM EDT reply actions   0 recs

I wouldn’t worry about anyone coming after you. Worst case is they send you a Cease and Desist and you have to take it down. You are not causing them any harm or making a profit. Don’t worry yourself

by KevinV on Jul 11, 2008 6:20 PM EDT reply actions   0 recs

Comments For This Post Are Closed


User Tools

Constantly updated Indians news, lots of in-depth analysis, live in-game discussions — and more fanatical and thoughtful Indians fans than every other web site combined.
Start posting about the Indians »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recent FanPosts

39135485-59af19dbb26654095f910f34176af094_4ae8a81e-scaled_small
Predictions Group
Small
I think we found our utility infielder
Small
Baseball in Japan
3444ant_black_small
Spring Training Trips
Hans_small
Trade Peralta?
3444ant_black_small
Beware the year of the Ox
Bos-fod_small
award-winning independent baseball documentary released
Calavera_small
Create your own 2010 BA Top Ten list (then sign Grady to an extension)
Img_0108_small
Jason Grilli Signs Minor League Deal?
Mariahcareyglitter_medium_medium_small
2009 AL Central Off-Season Transactions and Rumors

+ New FanPost All FanPosts >

Featured Poll

Poll
Who would you like to see hired to manage the Cleveland Indians?
Bobby Valentine
106 votes
Travis Fryman
41 votes
Manny Acta
113 votes
Don Mattingly
78 votes
Torey Lovullo
30 votes
Other
51 votes

419 votes | Poll has closed

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Lee has Foot Surgery
Jamey Wright Signed to Minor League Deal
2010 Cleveland Indians draft preview
Jim Ingraham Angers You.
BP Calls John Hart One of the, STRIKE THAT, BEST GMs of the 90s
Sizemore Addresses Offseason
Can we reach 1,000 on a post about Adam Kennedy?
"At this time, we're looking to make the biggest impact possible on the...
Casey Blake shaved his beard
ICBWDSTGFO550K

+ New FanShot All FanShots >

Sweet Baboo

Sweet Baboo, now with glitter


Managers

427px-nap_lajoie_1913_small Ryan

Dosequisman_small Jay

Authors

3444ant_black_small APV

47b8dd28b3127cceb64839d9746800000026102bauwjrq3za_small afh4