CS 전공/책, 자료들2015. 12. 8. 10:03

http://www.redbook.io/

Readings in Database Systems는 M. Stonebraker가 87년부터 정리해왔던 DB 분야 중요 논문 모음집으로,

 일명 레드북으로 불려지는, 대학원 수업 보조재로 주로 활용되는 책입니다. 

이번에 새로 5th ed.이 나왔네요.

새 집필진에는 Stanford에 새로 합류한 tx 분야의 신성 Peter Bailis 박사가 참여

(라 쓰고, 잡일좀 하라고  두 대가들한테 쪼여 참여한듯한...)

그런데 방식이 좀 바뀌었네요. 

예전에는 논문 묶음으로 보면 되었는데, 간단하게 요약식으로... 

그나저나 집필진들끼리도 의견 통일이 안된 글이 있다는데... 




Posted by Bart
CS 전공/책, 자료들2009. 10. 6. 23:31
최근에 CMP(Chip-level Multi Processor) 또는 Multi-core의 출현-사실 멀티코어의 대두는 frequency wall, power wall, heat dissipation 등 여러가지 이유 등으로 발생하게 되었지만, 자세한 출현 배경은 나중에 언급하도록 하자- 에 따라, 커리큘럼에 parallel programming 또는 multi-core programming 코스를 추가하는 미국 대학들이 많아지고 있다.

Memory bandwidth등의 문제로 아직 8 core 이상에서는  이상적인 성능을 보여주지는 못하는 듯 하지만,
사람들이 예측하기로는 32 core CPU가 시중에 유통될 날도 멀지 않을 것이라 한다.  뭐 인텔은 실험적으로 80-core CPU도 공개한 적이 있다. 
더 나아가 멀티 코어들을 여러개 꽂은 멀티 프로세서 시스템들도 쏙쏙 출현할 것이다. 거기에다 각 Core가 SMT (Simultaneous Multi Threading) 까지 지원한다면, 실제 돌릴 수 있는 시스템 쓰레드의 수는 엄청 많아질 수 있을 것이다. MapReduce 기술과 함께 바야흐로 Reminiscence of Parallelism이다. 
(사실 MapReduce와 Multi-core가 결합되는 예도 있는데, Stanford 애들이 만든 Pheonix라는 MapReduce의 Multi-core Programming Model로의 적용과 같은 연구도 있었다) 

아무튼 이런 저런 이유로 이제는 직렬 프로그래밍(sequential programming) 뿐만 아니라,  Parallel Programming 기법도 손에 익어두어야 할 필요가 있겠다.

아래는 여태껏 조사해 본 멀티코어 관련 강좌들:

1. Multicore Programming Primerhttp://www.youtube.com/watch?v=vhIwuNJzVG4
   MIT Open Courseware(http://ocw.mit.edu/OcwWeb/web/home/home/index.htm에서 제공하는 강좌
   2007년도 강의 비디오와 강의 노트를 제공한다. 
   올해 강의 홈페이지 및 자료는  http://groups.csail.mit.edu/cag/ps3/index.shtml에서,, 그런데 Cell   Broadband Chip을 중심으로 살짝 방향이 바뀐 거 같다

2. Parallel Programming for Multicorehttp://www.cs.berkeley.edu/~yelick/cs194f07/
    UC Berkeley 전산학과 학부  2007년도 강의

3.  Applications for Parallel Programming, http://www.cs.berkeley.edu/~yelick/cs267_sp07/
    UC Berkeley 전산학과 대학원  2007년도강의

4. An Introduction to Parallel Programming& Parallel Programming Tools,  http://www.sun.com/solutions/hpc/development.jsp
   Sun HPC(High Performance Computing)에서 제공하는 Courseware

5. High Performance Computing Training(https://computing.llnl.gov/?set=training&page=index)
  Lawrence Livermore National Laboratory에서 제공하는 강좌들
 Parallel Computing 을 비롯한 HPC 관련 강좌들을 여러 개 제공한다.

written by Bart
Posted by Bart
CS 전공/책, 자료들2009. 9. 12. 09:30
이번에 모 단체에 DB 관련 기술들에 대한 소개나 기술 동향, summary 논문등을 소개하고, 이를 검토하는 일을 맡았다.
 많고 많은 연구 논문들 중에서 너무 깊게 들어가지는 않으면서 그렇다고 너무 가볍지도 않게, 동종 분야의 연구자들이 쉽게 연구동향을 파악하기 좋도록 논문들을 선별해야 하는데 한달에 1건씩 제안할 수 있다니 1년에 12건 정도를 제안할 수 있겠다.

아래는 현재까지 정리해본 리스트:

1. Database Optimizations for Modern Hardware, Proceedings of IEEE, Vol 96(5), May 2008 pp.863-878
2. Breaking the memory wall in MonetDB, Communiactions of ACM, 12/08, pp.77-85
3. Data Management in the Cloud: Limitations and Opportunities, Data Engineering Bulleting, March 2009, Vol. 32, No.1, pp.3-13
4. Integrating NAND Flash Devices onto Servers, Communications of ACM, 04/09, pp.98-106
5. A Comparison of Approaches to large-Scale Data Analysis, In Proceedings of SIGMOD'09 June 09 
   & HadoopDB, In Proceedings of VLDB'09
6. A Survey of Uncertain Data Algorithms and Applications, IEEE TKDE Vol. 21, No.5, 609-623,  May 2009
7. Hard Disk Drives: The Good, The Bad and The Ugly, Communications of ACM,06-09, pp 38-45
8. The Claremont Report on Database Research Communications of ACM 06/09, pages. 56-65.
   SIGMOD Record March 2009
9. The Five-Minute Rule 20 Years Later (and How Flash memory Changes the Rules), Communications of ACM 07/09 pp. 48-59
10. Probabilistic Databases: Diamonds in the Dirt, Communications of ACM 07/09, pp. 86-95
11. The Pathologies of BIg Data, Communications of ACM 08/09, pp. 36-44

위 리스트는 확정된 것이 아니고, 상황에 따라 유동적이다.
다른쪽 세부분야의 논문들도 보면 좋겠는데, 지식의 한계로 본인이 그나마 먼지나 알고 있는 분야에서 논문들이 추려지는 느낌이다.  또 최근에 출간된 논문 내에서만 추려야 하다보니, 예전부터 사람들이 지긋이 파고 있는 분야는 넣기도 또 애매한 면이 있다.

혹, 다른 DB 업자들이 봐두면 좋을 것 같은 최근의(2009년 이후 출간된) introductory또는 survey paper를 가지고 계시다면, 추천해 주세요. 

Posted by Bart
CS 전공/책, 자료들2008. 9. 6. 09:08
사용자 삽입 이미지

Now in its eighth edition, this classic book remains the complete technical-writing reference for students and professionals alike. Alphabetically organized and easy to use, its nearly 400 entries provide guidance for the most common types of professional documents and correspondence, including reports, proposals, manuals, memos, and white papers. Abundant sample documents and visuals throughout the book demonstrate effective technical communication, reflecting current practices for formatting documents and using e-mail. In addition, advice on organizing, researching, writing, and revising complements thorough treatment of grammar, usage, style, and punctuation to provide comprehensive help with writing skills. This edition has been thoroughly revised to include expanded advice for analyzing the context of different writing situations, using and integrating visuals, and dealing with ethical concerns in technical writing. Improved coverage of research now includes guidelines for IEEE-style documentation as well as clearer explanations of copyright and plagiarism concerns. Entries throughout the book have been revised, updated, consolidated, and streamlined to provide the most accurate and accessible information. Comprehensive yet concise, the Handbook of Technical Writing remains the quick reference faithful users have come to appreciate.
http://www.amazon.com/Handbook-Technical-Writing-Eighth-Practices/dp/0312352670

* 요새 읽고 있는 책, 이런 책은 이바닥 입문 전에 읽어 두었어야 되는건데....ㅡㅡ;
Posted by Bart
CS 전공/책, 자료들2008. 3. 25. 17:41

Readings in Database Systems, Third Edition

Edited by

Michael Stonebraker
University of California, Berkeley

Joseph M. Hellerstein
University of California, Berkeley

CHAPTER 1 The Roots

A Relational Model of Data for Large Shared Data Banks, CACM 13(6) 1970
Edgar F. Codd
System R: Relational Approach to Database Management, TODS 1(2) 1976
Morton M. Astrahan, Mike W. Blasgen, Donald D. Chamberlin, Kapali P. Eswaran, Jim Gray,
Patricia P. Griffiths, W. Frank King III, Raymond A. Lorie, Paul R. McJones, James W. Mehl,
Gianfranco R. Putzolu, Irving L. Traiger, Bradford W. Wade, and Vera Watson
The Design and Implementation of INGRES, TODS 1(3) 1976
Michael Stonebraker, Eugene Wong, Peter Kreps, and Gerald Held
A History and Evaluation of System R, CACM 24(10) 1981
Donald D. Chamberlin, Morton M. Astrahan, Mike W. Blasgen, Jim Gray, W. Frank King III,
Bruce G. Lindsay, Raymond A. Lorie, James W. Mehl, Thomas G. Price, Gianfranco R. Putzolu,
Patricia G. Selinger, Mario Schkolnick, Donald R. Slutz, Irving L. Traiger, Bradford W. Wade,
and Robert A. Yost
Retrospection on a Database System, TODS 5(2) 1980
Michael Stonebraker

CHAPTER 2 Relational Implementation Techniques

Operating System Support for Database Management, Database Engineering Bulletin 9(3) 1986
Michael Stonebraker
R-Trees: A Dynamic Index Structure for Spacial Searching, SIGMOD 1984
Antonin Guttman
Generalized Search Trees for Database Systems, VLDB 1995
Joseph M. Hellerstein, Jeffrey F. Naughton, and Avi Pfeffer
An Evaluation of Buffer Management Strategies for Relational Database Systems, Algorithmica 1(3) 1986
Hong-Tai Chou and David J. DeWitt
Join Processing in Database Systems with Large Main Memories, TODS 11(3) 1986
Leonard D. Shapiro
Access Path Selection in a Relational Database Management System, SIGMOD 1979
Patricia G. Selinger, Morton M. Astrahan, Donald D. Chamberlin, Raymond A. Lorie, and Thomas G. Price
Query Rewrite Optimization Rules in IBM DB2 Universal Database
T. Y. C. Leung, H. Pirahesh, P. Seshadri, and J. Hellerstein

CHAPTER 3 Transaction Management

Granularity of Locks and Degrees of Consistency in a Shared Data Base, IFIP Working Conference on Modelling in Data Base Management Systems 1976
Jim Gray, Raymond A. Lorie, Gianfranco R. Putzolu, and Irving L. Traiger
On Optimistic Methods for Concurrency Control, TODS 6(2) 1981
H. T. Kung and John T. Robinson
Concurrency Control Performance Modeling: Alternatives and Implications, TODS 12(4) 1987
Rakesh Agrawal, Michael J. Carey, and Miron Livny
Efficient Locking for Concurrent Operations on B-Trees, TODS 6(4) 1981
Philip L. Lehman and S. Bing Yao
Principles of Transaction-Oriented Database Recovery, ACM Computing Surveys 15(4) 1983
Theo Härder, Andreas Reuter
ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging, TODS 17(1) 1992
C. Mohan, Donald J. Haderle, Bruce G. Lindsay, Hamid Pirahesh, and Peter M. Schwarz
The Design of a POSTGRES Storage System, VLDB 1987
Michael Stonebraker
The ConTract Model, Database Transaction Models for Advanced Applications 1992
Helmut Wachter and Andreas Reuter

CHAPTER 4 Distributed Database System

R*: An Overview of the Archtecture, JCDKB 1982
R. Williams, Dean Daniels, Laura M. Haas, George Lapis, Bruce G. Lindsay,
Pui Ng, Ron Obermarck, Patricia G. Selinger, Adrian Walker, Paul F. Wilms,
and Robert A. Yost
R* Optimizer Validation and Performance Evaluation for Distributed Queries, VLDB 1986
Lothar F. Mackert and Guy M. Lohman
Transaction Management in the R* Distributed Database Management System, TODS 11(4) 1986
C. Mohan, Bruce G. Lindsay, and Ron Obermarck
The Dangers of Replication and a Solution, SIGMOD 1996
Jim Gray, Pat Helland, Patrick E. O'Neil, and Dennis Shasha
Mariposa: A Wide-Area Distributed Database System, VLDB Journal 5(1) 1996
Michael Stonebraker, Paul M. Aoki, Witold Litwin, Avi Pfeffer, Adam Sah,
Jeff Sidell, Carl Staelin, and Andrew Yu

CHAPTER 5 Parallel Database System

Parallel Database Systems: The Future of High Performance Database Systems, CACM 35(6) 1992
David J. DeWitt and Jim Gray
The Gamma Database Machine Project, TKDE 2(1) 1990
David J. DeWitt, Shahram Ghandeharizadeh, Donovan A. Schneider, Allan Bricker, Hui-I Hsiao,
and Rick Rasmussen
AlphaSort: A Cache-Sensitive Parallel External Sort, VLDB Journal 4(4) 1995
Chris Nyberg, Tom Barclay, Zarka Cvetanovic, Jim Gray, and David B. Lomet
Coloring Away Communication in Parallel Query Optimization, VLDB 1995
Waqar Hasan and Rajeev Motwani

CHAPTER 6 Objects in Databases

The ObjectStore Database System, CACM 34(10) 1991
Charles Lamb, Gordon Landis, Jack A. Orenstein, and Danel Weinreb
QuickStore: A High Performance Mapped Object Store, SIGMOD 1994
Seth J. White and David J. DeWitt
Client-Server Caching Revisited, IWDOM 1992 (SIGMOD 1991)
Michael J. Franklin and Michael J. Carey
The Database Language GEM, SIGMOD 1983
Carlo Zaniolo
Inclusion of New Types in Relational Data Base System, ICDE 1986
Michael Stonebraker
The POSTGRES Next-Generation Database Management System, CACM 34(10) 1991
Michael Stonebraker and Greg Kemnitz

CHAPTER 7 Data Analysis and Decision Support

Improved Query Performance with Variant Indexes, SIGMOD 1997
Patrick E. O'Neil and Dallan Quass
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, Data Mining and Knowledge Discovery 1(1) 1997 (ICDE 1996)
Jim Gray, Surajit Chaudhuri, Adam Bosworth, Andrew Layman, Don Reichart,
Murali Venkatrao, Frank Pellow, and Hamid Pirahesh
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates, SIGMOD 1997
Yihong Zhao, Prasad Deshpande, and Jeffrey F. Naughton
Fast Algorithms for Mining Association Rules in Large Databases, VLDB 1994
Rakesh Agrawal and Ramakrishnan Srikant
Online Aggregation, SIGMOD 1997
Joseph M. Hellerstein, Peter J. Haas, and Helen J. Wang

CHAPTER 8 Benchmarking Database Systems

A Measure of Transaction Processing Power
Anon et al.
The OO7 Benchmark, SIGMOD 1993
Michael J. Carey, David J. DeWitt, and Jeffrey F. Naughton
The Sequoia 2000 Storage Benchmark, SIGMOD 1993
Michael Stonebraker, James Frew, Kenn Gardels, and Jeff Meredith

CHAPTER 9 Vision Statements

Database Metatheory: Asking and Big Queries, PODS 1995
Christos H. Papadimitriou
Database Systems: Achievements and Opportunities, CACM 34(10) 1991
Abraham Silberschatz, Michael Stonebraker, and Jeffrey D. Ullman
Strategic Directions in Database Systems - Breaking Out of the Box, ACM Computing Surveys 28(4) 1996
Abraham Silberschatz and Stanley B. Zdonik

'CS 전공 > 책, 자료들' 카테고리의 다른 글

Introductory or Survey Papers  (0) 2009.09.12
Handbook of Technical Writing  (3) 2008.09.06
Readings in Information Retrieval  (0) 2008.03.25
Jennifer Widom 강의 자료  (0) 2008.02.22
웹 2.0 시대의 기회 시맨틱 웹  (0) 2007.11.12
Posted by Bart
CS 전공/책, 자료들2008. 3. 25. 17:40

Readings in Information Retrieval

Edited by

Karen Sparck Jones
University of Cambridge

Peter Willett
University of Sheffield

Chapter 1 Overall Introduction

Chapter 2 History

The Thesaurus Approach to Information Retrieval
T. Joyce and R. M. Needham
The Automatic Derivation of Information Retrieval Encodements from Machine-Readable Texts
H. P. Luhn
Indexing and Abstracting by Association. Part I
L. B. Doyle
On Relevance, Probabilistic Indexing and Information Retrieval
M. E. Maron and J. L. Kuhns
The Cranfield Tests on Index Language Devices
C. W. Cleverdon
Computer Evaluation of Indexing and Text Processing, JACM 15(1) 1968
Gerard Salton and M. E. Lesk

Chapter 3 Key Concepts

The Concept of "Aboutness" in Subject Indexing
W. J. Hutchins
The Testing of Index Language Devices
C. W. Cleverdon and J. Mills
Thesaurus
D. J. Foskett
Using Problem Structures for Driving Human-Computer Dialogues
P. J. Daniels, H. M. Brooks, and N. J. Belkin
Relevance: A Review of and a Framework for Thinking
T. Saracevic

Chapter 4 Evaluation

A Study of Information Seeking and Retreving. I. Background and Methodology
T. Saracevic, P. Kantor, A. Y. Chamis, and D. Trivison
On Selecting a Measure of Retrieval Effectiveness. Part I
W. S. Cooper
The Pragmatics of Information Retrieval Experimentation, Revisited
J. Tague-Sutcliffe
Presenting Results of Experimental Retrieval Comparisons
E. M. Keen
MEDLARS: Report on the Evaluation of Its Operating Efficiency
F. W. Lancaster
The TREC Conferences
D. K. Harman

Chapter 5 Models

Getting Beyond Boole
W. S. Cooper
A Non-Classical Logic for Information Retrieval
C. J. van Rijsbergen
A Vector Space Model for Automatic Indexing, CACM 18(11) 1975
Gerard Salton, A. Wong, and C. S. Yang
The Probability Ranking Principle in IR
S. E. Robertson
Inference Networks for Document Retrieval
H. Turtle and W. B. Croft
ASK for Information Retrieval. Part I. Background and Theory
N. J. Belkin, R. N. Oddy, and H. M. Brooks

Chapter 6 Techniques

An Algorithm for Suffix Stripping
M. F. Porter
Robust Text Processing in Automated Information Retrieval
T. Strzalkowski
Term-Weighting Approaches in Automatic Text Retrieval, Retrieval., Information Processing and Management 24(5) 1988
Gerard Salton and C. Buckley
Search Term Relevance Weighting Given Little Relevance Information
K. Sparck Jones
Using Probabilistic Models of Document Retrieval without Relevance Information
W. B. Croft and D. J. Harper
Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval
S. E. Robertson and S. Walker
Improving Retrieval Performance by Relevance Feedback, JASIS 41(4) 1990
Gerard Salton and C. Buckley
Using Interdocument Similarity Information in Document Retrieval Systems
A. Griffiths, H. C. Luckhurst, and P. Willett

Chapter 7 Systems

The SMART and SIRE Experimental Retrieval Systems
Gerard Salton and M. J. McGill
Architecture of an Expert System for Composite Document Analysis, representation, and Retrieval
E. A. Fox and R. K. France
User-Friendly Systems Instead of User-Friendly Front-Ends
D. K. Harman
The Okapi Online Catalogue Research Projects
S. Walker
TREC and TIPSTER Experiments with INQUERY
J. P. Callan, W. B. Croft, and J. S. Broglio
RUBRIC: A System for Rule-Based Information Retrieval
B. P. McCune, R. M. Tong, and J. Dean
TARGET and FREESTYLE: DIALOG and Mead Join the Relevance Ranks
C. Tenopir and P. Cahn

Chapter 8 Extensions

A Hypertext Environment for Interacting with Large Databases
M. Agosti, G. Gradenigo, and P. G. Marchetti
Automatic Analysis, theme Generation, and Summarization of Machine-Readable Texts
Gerard Salton, J. Allan, C. Buckley, and A. Singhal
Querying Across Languages: A Dictionary-Based Approach to Multilingual Information Retrieval
D. A. Hull and G. Grefenstette
Experiments in Spoken Document Retrieval
K. Sparck Jones, G. J. F. Jones, J. T. Foote, and S. J. Young
Video Parsing, Retrieval and Browsing: An Integrated and Content-Based Solution
H. J. Zhang, C. Y. low, S. W. Smoliar, and J. H. Wu
The Automatic Indexing System AIR/PHYS - From Research to Application
P. Biebricher, N. Fuhr, G. Lustig, M. Schwantner, and G. Knorz
A News Story Categorization System
P. J. Hayes, L. E. Knecht, and M. J. Cellio
Conceptual Information Extraction and Retrieval from Natural Language Input
L. F. Rau
A Production Rule System for Message Summarization
E. Marsh, H. Hamburger, and R. Grishman
The Application of Linguistic Processing to Automatic Abstract Generation
F. C. Johnson, C. D. Paice, W. J. Black, and A. P. Neal

Chapter 9 Envoi

Historical Note: Information Retrieval and the Future of an Illusion
D. R. Swanson
Posted by Bart
CS 전공/책, 자료들2008. 2. 22. 22:29



http://www.cs.uiuc.edu/calendar/videoarchive.php 에 보면 UIUC에서 Distinguished academic lecture들을 서비스를 한다. 이중에서 Jennifer Widom 아줌마의 Stream Data Management System에 대한 강의.
요새는 서울대 컴퓨터공학부(http://cse.snu.ac.kr)에서도 비슷한 서비스를 하고 있다.

알고 싶은 의지만 있으면, 배움의 소스는 참 많음을 알수 있다. 문제는... 귀찮다는 거지 ㅡ,.ㅡ

* 연사 소개 전까지 몇분간 소리가 안납니다. 오디오가 나간것이 아님.


Posted by Bart
CS 전공/책, 자료들2007. 11. 12. 17:41
김중태 문화원에 공개된 시맨틱 웹
 - 책으로 출판된 내용을 그대로 웹에 게시하였다.(벌이가 되시려는가 약간은 걱정이 된다.)
 - 문화적인 파급효과를 다루지만, 내부적인 기술 부문에 대해서는 크게 다뤄지지는 않은 느낌이다.
 - 그래도 한번 읽어보기엔 괜찮은 책이다.

http://www.dal.co.kr/chair/semanticweb/sw.html
Posted by Bart
CS 전공/책, 자료들2007. 11. 12. 17:37

사용자 삽입 이미지

데이터베이스 컨설팅계의 유명한 이화식 씨께서 이전의 책을 새로 교정하여 내놓은 책입니다.
교재에서는 얻을 수 없는 저자의 경험을 통해 얻어진 많은 내용들이 있더군요.
실제 필드에서 DB를 다루시는 분들이라면 꼭 보시라고 추천하고 싶습니다.
아래의 글을 덧붙여서...
 
"In Theory there is no difference between theory and practice. In practice there is."
Jan L.A van de Snepscheut
Sep. 12, 1953 -Feb. 23, 1994
Posted by Bart