Reffer madness
- From a 10-6-2008 discussion on #openlibrary on irc.freenode.net
- Reffer madness part I-II. Prefaced by a blog post on the longest now.
August 2008 at 7:20 am great work! ,
Actively attending
brassratgirl : Phoeb. E mako : B. M. Hill jgay : J. Gay _sj_ : SJ. Klein solrize : S. Olrize stargirl : H Wollach
Papers cited
- Example : from Andrew McCallum
- An annotation scheme for citation function by Simone Teufel, Advaith Siddharthan, Dan Tidhar (2006)
- John M. Ziman. 1968. Public Knowledge: An Essay Concerning the Social Dimensions of Science. Cambridge University Press, Cambridge, UK.
- the only book I ever borrowed from the Harvard Physics Library without returning Sj
- Ina Spiegel-Ruesing. 1977. Bibliometric and content analysis.
- John M. Ziman. 1968. Public Knowledge: An Essay Concerning the Social Dimensions of Science. Cambridge University Press, Cambridge, UK.
Social Studies of Science, 7:97-113.
CITE UNSEEN
on classifying academic papers
<jgay> another piece of software, Cora Research Paper Classification [relational document classification] - Research papers classified into a topic hierarchy with 73 leaves. We call this a relational data set, because the citations provide relations among papers.
- err, rather, those are data sets we can use with the osftware
McCallum and Wollach
- Andrew McCallum
- He did Rexa and then here two short descriptions of papers he published last year http://pastebin.com/m18a601a6
- The two essays are "Learning to Predict the Quality of Contributions to Wikipedia" and "Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression". I think if he can automate 90% accuracy rates with his programs, then he'll know what kinds of citations are good ones.
- Also with Hannah Wallach he did a great paper entitled "Community-based Link Prediction with Text."
- <mako> jgay: hanna mentioned tihs
- <jgay> mako, his more recent work is more relevant, though
on reffing
Different ways of saying the same thing: revisit until all is self-similar and beautiful.
classes of refs
explicitly noting a dependency on a source's legitimacy (usually implying the reference is viewed positively and as a source of accuracy/legitimacy, save in satire or proof by counterexample):
- 'based (in some part) on',
- 'uses as positive reference/proof',
- 'uses as negative reference/proof'
explicitly stating legitimacy:
- 'promotes/supports',
- 'attempts to prove',
exlpicitly stating illegitimacy:
- 'discounts/criticizes',
- 'attempts to disprove'
meta-ref:
- 'cites as transmitter of fundamental cite'
- <sj> there's actually a lot of conflation of proximal reference with original source that goes on when one is lazy or pressed for time leading at times to the wrong people being recognized for discoveries when this was not their intent
- <jgay> _sj_, yeah, that is really common.
- <sj> there's actually a lot of conflation of proximal reference with original source that goes on when one is lazy or pressed for time leading at times to the wrong people being recognized for discoveries when this was not their intent
anti-ref:
- 'presents a different and possibly incompatible perspective'
- 'referred to for research but provided no inspiration for any section'
uses of sources
- "I am relying on this source"
- "I am refuting this source"
- "I found this a source of amusement"
- "this source was in my pile of library books at the end of the day, like the extra screws left over when you're done putting your whatsit back together"
types of cites
- nocite - influential work is used but not referenced or cited.
- noncite - incluential work is referenced in text but not in a cite
- anticite - citing a work to indicate it was read or reviewed as a potential reference, but could not be used anywhere in the work
- fauxcite - a random cite to make a section look better reffed than it is, not related
- selfcite - citing self's work as prior art; one can cite all of one's prior publications if one is godo at this, in each new work
- bibliocite - a cite to indicate a work was part of the reading/background
- middlecite - an intermediary who is citing the underlying original source, but was the work directly read by the author. there can be many layers of middleciting
- poison cite - intended to reframe the real meaning of the cited work; cite doesn't really say what it's imputed to say
- misleading cite - intended to confuse the course of a discussion; cite doesn't affect the argument the way it's implied to
asides
<brassratgirl> _sj_: any citation scheme doesn't fix the having access problem
on why this is useful
<solrize> is this something anyone has cared about in the past N centuries of academic publishing? <solrize> i mean usually one just explains in words how the ref was used
- <brassratgirl> solrize: sure, but there arent' that many people who are actually interested in citation styles
- <_sj_> solrize, it has generally been considered charming that being a good scribe requires massive amounts of time and unique access to exotic journals
<_sj_> brg, I would say that this style standard shoulde xplicitly push for clarity in the cite for two reasons.
- 1) aggregation : you want to be able to combine a number of cites together, or combine cites through a chain of documents this is less possible if your aggregator has to parse natural language to make outall of the possible meanings your citing sentence may have
- ...
- 2) for parallelism : if you rely on natural language to define what you mean by a cite it will be more different among citers than if people have to explicitly pick a style. a style that says "I cite this to indicate I think it is wrong" is explicitly different from a "I cite this because it influenced me [negatively]"
on irc
For discussions like this we really need a tangents/talk channel and a get-shit-done channel.
- every chan should have a get-shit-done channel. what to name it? sj
- ## tends to mean off-topic -jgay
- so what means more-on-topic? sj
- ## tends to mean off-topic -jgay
I AM THE ANTICITES
to come...